Skip to content

How do resource requirements of the Scheduler scale in relation to the task and workers? #7515

Answered by chrisroat
joyofdata asked this question in Q&A
Discussion options

You must be logged in to vote

In using the distributed scheduler, I have found that:

  • I need to allocate a full CPU. CPU fluctuates a lot during initialization, running, and completion... but often can be near 100%. Anecdotally (I haven't measured it precisely), the baseline CPU usage seems higher with more workers.
  • The RAM usage is low (sub GB) for small to moderate graphs. But it seems to grow super-linearly for large graphs (10k+ tasks). For 100k tasks, I need 1GB of RAM and for 200k tasks, I need 3.5GB of RAM.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by joyofdata
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants