I have a function that parallizes another function via multiprocessing pool which takes a dictionary as input. I would expect that the code below just prints the number from 0 to 32. However, the result shows that there are many numbers being printed more than once.
Anybody an idea?
import multiprocessing as mp
import numpy as np
import functools
def test(name, t_dict):
t_dict['a'] = name
return t_dict
def mp_func(func, iterator ,**kwargs):
f_args = functools.partial(func, **kwargs)
pool = mp.Pool(mp.cpu_count())
res = pool.map(f_args, iterator)
pool.close()
return res
mod =dict()
m =33
res = mp_func(func=test, iterator=np.arange(m), t_dict=mod)
for di in res:
print(di['a'])
As everyone is telling you, in general, it is a bad idea to have multiple threads/processes all modifying the same location, and then expecting that location to have the value that your thread gave it.
Your code will run better if all mutating of the shared data structure happens in only one place. So the general plan is: