How many processors should be used with multiprocessing.Pool?

Question

I am trying to use multiprocessing.Pool to run my code in parallel. To instantiate Pool, you have to set the number of processes. I am trying to figure out how many I should set for this. I understand this number shouldn't be more than the number of cores you have but I've seen different ways to determine what your system

Accepted Answer

The difference between the two is clearly stated in the doc:multiprocessing.cpu_count()Return the number of CPUs in the system.This number is not equivalent to the number of CPUs the current process can use. The number of usable CPUs can be obtained with len(os.sched_getaffinity(0)).So even if you are on a 128-core system, your program could have been somehow limited to only run on a specific set of 10 out of the 128 available CPUs. Since affinity also applies to child threads and processes, it doesn&#8217;t make much sense to spawn more than 10. You could however try to increase the number of available CPUs through os.sched_setaffinity() before starting your pool.import osimport multiprocessing as mpcpu_count = mp.cpu_count() if len(os.sched_getaffinity(0)) < cpu_count:    try:        os.sched_setaffinity(0, range(cpu_count))    except OSError:        print('Could not set affinity')n = max(len(os.sched_getaffinity(0)), 96)print('Using', n, 'processes for the pool')pool = mp.Pool(n)# ...See also man 2 sched_setaffinity.

Advertisement

Answer