I want to calculate a large distance matrix, based on a higher dimensional vector. For instance, I have 1000 instances each represented by 20 vectors of length 10. The distance between each two instances is given by the mean distance between each of the 20 vectors associated to each vector. So I want to go from a 1000 by 20
Tag: dask-distributed
Dask: Continue with others task if one fails
I have a simple (but large) task Graph in Dask. This is a code example Here SomeIterable is a list of dict, where each are arguments to my_function. In each iteration b depends on a, so if the task that produces a fails, b can’t be computed. But, each element of results are independent, so I expect if one fails,
Where does dask store files while running on juputerlab
I’m running dask on jupyterlab. I’m trying to save some file in home directory where my python file is stored and it’s running properly but I’m not able to find out where my files are getting saved. So I made a folder named output in home directory to save file inside, but when I save file inside it I’m getting
Dask distributed.scheduler – ERROR – Couldn’t gather keys
I created a dask cluster using two local machines using I am trying to find best parameters using dask gridsearchcv. I am facing the following error. I hope someone helps in solving this issue. Thanks in advance. Answer I also meet the same issue, and I find it’s likely to be caused by firewall. Suppose we have two machines, 191.168.1.1