-
BackgroundMy sys-admin has asked me to use SLURM's Job Arrays, however all but one of my workers will die off in the array due to [what seems to be] a naming error. I'm currently using the "fix" specified here. QuestionIs it possible to manually set the Stacktrace,
|
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 12 replies
-
cc @dask/dask-jobqueue-slurm if anyone gets a moment
I don't have experience using |
Beta Was this translation helpful? Give feedback.
-
NOTE: Following dask/dask-jobqueue#480, the following will work: #7070 (reply in thread).To be clear, the solution I found was the following:
So calling the following should work: cluster = JobArrayCluster(..., name="project-${JOB_ID}", job_extra=["--array=0-50"], env_extra=[
"JOB_ID=${SLURM_ARRAY_JOB_ID}_${SLURM_ARRAY_TASK_ID}",
]) NOTE: I've only tested this on a SLURM cluster as it's the only Batch Manager I have access to. |
Beta Was this translation helpful? Give feedback.
-
By the way @ionlights I would be very curious if you can give a rough estimate of the number of very similar jobs that is acceptable to launch on your cluster without job arrays? In other words, on your cluster and without job arrays, is it OK to use Dask-Jobqueue and do
Also I would be curious if you can give the configuration of your |
Beta Was this translation helpful? Give feedback.
NOTE: Following dask/dask-jobqueue#480, the following will work: #7070 (reply in thread).
To be clear, the solution I found was the following:
JobQueueCluster
, like the following:name
intoJobArrayCluster
like the following:project-${JOB_ID}
(the${JOB_ID}
is the important bit).job_extra
to include the syntax for s…