Skip to content

Integrating Airflow with SLURM #24076

Answered by ecodina
ecodina asked this question in Q&A
Jun 1, 2022 · 3 comments · 20 replies
Discussion options

You must be logged in to vote

Hi @potiuk

Thank you for your answer!

We've ended up developing a deferrable operator and a trigger. It first submits the job to Slurm and then it defers itself until the trigger detects a state change / new output from the slurm job's log file. Depending on the slurm's state, we defer the operator again, finish OK or raise an AirflowException.

Since triggers are able to run in a highly-available fashion, we will be able to to restart Airflow for any reason without losing track of the already submitted Slurm jobs.

As for idempotency, in our case rerunning the task overrides the data.

Replies: 3 comments 20 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
20 replies
@ecodina
Comment options

@bejota
Comment options

@ecodina
Comment options

@bejota
Comment options

@bhulsey
Comment options

Answer selected by potiuk
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
10 participants