Pickle/unpickle only once per worker

Question

I am using python multiprocessing module to spread say 10000 steps of a given task on 4 workers using a Pool. The task that is sent on the workers is a method of a complex object. If I understand right the documentation, pickle is used to dump and load the object at each step which means 10000 pickling/unpick…

Accepted Answer

How about using Processes instead?If such a structure is feasible for your use case, you can create another function for workers which run any target function you require. Then start the worker functions using multiprocessing.Process like below:import mathimport multiprocessingclass Analysis:    def run_step(self):        print('run_step')    def __getstate__(self):        print('I dump')        return self.__dict__    def __setstate__(self,state):        print('I load')        self.__dict__ = statedef worker(target, num):    for _ in range(num):        target()if __name__ == "__main__":    a = Analysis()    proc = []    proc_num = 4    runs = 10    per_proc_run = math.ceil(runs/proc_num)  # A little inaccurate but I am sure you can figure something out :)    for _ in range(proc_num):        proc.append(multiprocessing.Process(target=worker, args=(a.run_step, per_proc_run)))        proc[-1].start()    for process in proc:        process.join()Output:I dumpI dumpI dumpI dumpI loadrun_steprun_steprun_stepI loadrun_steprun_steprun_stepI loadrun_steprun_steprun_stepI loadrun_steprun_steprun_stepPickles/Unpickles only once per worker. You could probably replicate the same thing in pools but I find this more straightforward.

Advertisement

Answer