Linear sum assignment (SciPy) and balancing the costs

Question

I am having difficulty using scipy.optimize.linear_sum_assignment to evenly distribute tasks (costs) to workers, where each worker can be assigned multiple tasks. The cost matrix represents the workload of each task for each worker. We want to minimize the total costs of all workers, while evenly distributing the costs of each worker. In this example, we have 3 workers named a,

Accepted Answer

The linear_sum_assignment method doesn’t support constraints or a custom objective, so I don’t think this is possible.However, you could formulate your problem as a mixed-integer linear programming problem (MILP) and solve it by means of PuLP1. In order to evenly distribute the total costs per worker, you could minimize the difference between the maximum and the minimum total costs per worker. Here’s a possible formulation:Sets:- workers = ["a", "b", "c"]- tasks = [1, 2, ..., 12]Variables:- x[w,t] = 1 iff worker w is assigned to task t, 0 otherwise- min_val- max_valModel:min max_val - min_vals.t. # each worker is assigned to exactly n_tasks_per_worker taskssum(x[w,t] for t in tasks) == n_tasks_per_worker for all w in workers# each task can only be assigned oncesum(x[w,t] for w in workers) == 1 for all t in tasks# evenly distribute the taskssum(x[w,t] for t in tasks) <= max_val for all w in workerssum(x[w,t] for t in tasks) >= min_val for all w in workersThe code is straightforward:import pulpimport numpy as npworkers = ["a", "b", "c"]n_workers = len(workers)n_tasks_per_worker = 4n_tasks = n_workers * n_tasks_per_workerc = np.array([[27, 42, 65, 33, 67, 45, 60, 76, 6, 6, 43, 26], [27, 42, 65, 33, 67, 45, 60, 76, 6, 6, 43, 26], [27, 42, 65, 33, 67, 45, 60, 76, 6, 6, 43, 26]])# create the modelmdl = pulp.LpProblem("even_assignment")# decision variablesx = {}for w in workers: for t in range(n_tasks): x[w, t] = pulp.LpVariable(f"x[{w}, {t}]", cat="Binary")max_val = pulp.LpVariable("max_val", cat="Continuous")min_val = pulp.LpVariable("min_val", cat="Continuous")# objective: minimize the difference between the maximum and the minimum# costs per workermdl.setObjective(max_val - min_val)# constraint: each worker gets assigned exactly n_tasks_per_workerfor w in workers: mdl.addConstraint(sum(x[w, task] for task in range(n_tasks)) == n_tasks_per_worker)# constraint: each task can only be assigned oncefor task in range(n_tasks): mdl.addConstraint(sum(x[w, task] for w in workers) == 1)# constraint: evenly distribute the tasksfor i_w, w in enumerate(workers): assignment_cost = sum(x[w,task]*c[i_w,task] for task in range(n_tasks)) mdl.addConstraint(assignment_cost <= max_val) mdl.addConstraint(assignment_cost >= min_val)# solve the problemmdl.solve()# Outputfor i_w, w in enumerate(workers): worker_cost = sum(x[w, t].varValue*c[i_w, t] for t in range(n_tasks)) print(f"costs for worker {w}: {worker_cost:.2f}")This gives mecosts for worker a: 165.00costs for worker b: 167.00costs for worker c: 164.00[1] To be exact, PuLP isn’t a solver, it’s just a modelling framework that can pass MILPs to MILP Solvers like CBC, SCIP, HiGHS etc.

Advertisement

Answer