Skip to content

SLURMCluster - declaring multiple nodes within single sbatch command #7467

Answered by andersy005
mtsokol asked this question in Q&A
Discussion options

You must be logged in to vote

@mtsokol,

As I understand it, dask-jobqueue won't help you in this particular case because dask-jobqueue assumes that settings defined inSLURMCluster() correspond to a single job on one node. As far as dask-jobqueue is concerned passing -N25 or --nodes=25 to SLUMCluster won't make sense.

For executing one experiment I need 600 cores for about an hour.

For workloads like this where you want to submit a job, and you are getting results after an hour, I'd recommend giving dask-mpi a chance. With dask-mpi you should be able to customize how resources are allocated by your batch queueing system (For instance, you can specify -N25, and should be able to get the 600 cores in one big job) and t…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@mtsokol
Comment options

Answer selected by mtsokol
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants