WIP for launching batch runs on AWS directly from COVIDScenarioPipeline #223

jwills · 2020-04-20T20:25:50Z

Going to be iterating on this for a bit, but posting it for feedback from folks now.

samshah · 2020-04-20T22:36:00Z

batch/job_launcher.py

+import yaml
+
+@click.command()
+@click.option("-p", "--job-prefix", type=str, required=True,


nit: intelligent defaults to reduce the number of required params would be nice. e.g., job-prefix is name in config file plus a uniqifer, num-jobs is calculated based on # of scenarios/# of sims with some cap, etc.

samshah · 2020-04-20T22:40:30Z

batch/job_launcher.py

+    # Update and save the config file with the number of sims to run
+    print("Updating config file %s to run %d simulations..." % (config_file, sims_per_job))
+    config = open(config_file).read()
+    config = re.sub("nsimulations: \d+", "nsimulations: %d" % sims_per_job, config)


would use pyyaml or the utils.config module to read-modify-write versus regex, this seems error-prone

yeah, I did at first, but the output was a total re-ordering of the entries in the config file, which seemed not great; I wonder if there's a way to keep the ordering of the entries consistent on the read-edit-write cycle?

According to this thread, you need to set sort_keys=False on the dump. Trying it locally, this preserves key order in the file, but strips comments and whitespace.

I feel like we would be okay with that, right?

yeah, i think it's fine

samshah · 2020-04-20T22:43:49Z

batch/job_launcher.py

+            {"name": "DVC_OUTPUTS", "value": " ".join(dvc_outputs)},
+            {"name": "S3_RESULTS_PATH", "value": results_path}
+    ]
+    s3_cp_run_script = "aws s3 cp s3://%s/%s $PWD/run-covid-pipeline" % (s3_input_bucket, runner_script_name)


(aside, don't know if you know about fstrings in python 3.6+, they make subbing variables much easier/neater)

I do not really; will take a look!

samshah · 2020-04-20T22:46:57Z

batch/job_launcher.py

+@click.option("-i", "--s3-input-bucket", "s3_input_bucket", type=str, default="idd-input-data-sets")
+@click.option("-o", "--s3-output-bucket", "s3_output_bucket", type=str, default="idd-pipeline-results")
+@click.option("-d", "--job-definition", "batch_job_definition", type=str, default="Batch-CovidPipeline-Job")
+@click.option("-q", "--job-queue", "batch_job_queue", type=str, default="Batch-CovidPipeline")


nit: for these params, i think you want show_default=True set so when you do a --help you'll see what the default value is

also missing helpstrings :)

samshah · 2020-04-20T22:50:13Z

nice! this looks really good -- thanks for doing it! i only have minor/trivial comments

…a know, actually running them

perifaws · 2020-04-21T01:48:04Z

batch/runner.sh

+	for output in "${DVC_OUTPUTS_ARRAY[@]}"
+	do
+		"Saving output $output"
+		tar cv --use-compress-program=pbzip2 -f $output.tar.bz2 $output


Is compression required? We took that route initially but found that just a sync was sufficient and faster overall.

perifaws · 2020-04-21T01:51:47Z

batch/launch_job.py

+    s3_cp_run_script = f"aws s3 cp s3://{s3_input_bucket}/{runner_script_name} $PWD/run-covid-pipeline"
+    command = ["sh", "-c", f"{s3_cp_run_script}; /bin/bash $PWD/run-covid-pipeline"]
+    container_overrides = {
+            'vcpus': 72,


Should one allow an override on that? If the advanced user wants a different configuration?

jkamins7 · 2020-04-21T19:10:29Z

can we change this to use dev instead of dataseed? trying to replace dataseed as our slightly less stable bracnh

WIP for launching batch runs on AWS directly from COVIDScenarioPipeline

f3755cb

jwills requested review from jkamins7, kkintaro, perifaws and samshah April 20, 2020 20:26

Josh Wills added 3 commits April 20, 2020 14:53

dvc init w/o usage tracking

655b2d1

Always ignore the data/ directory now

dafe067

git should ignore the standard output file directories

5666783

samshah reviewed Apr 20, 2020

View reviewed changes

Josh Wills added 5 commits April 20, 2020 16:19

More batch/dvc workflow updates

bc8a4df

Integrate Sam's feedback

e89e066

f-string all the things

179cc85

Various fixes to the batch/dvc setup and run scripts I found while, y…

74167b5

…a know, actually running them

Add a forcing call for this since I always forget to run it

9b13790

perifaws reviewed Apr 21, 2020

View reviewed changes

Some fixes to make the batch runner script more useful/debuggable

f1930be

jkamins7 changed the base branch from dataseed to dev April 22, 2020 00:36

jkamins7 added 2 commits April 29, 2020 15:54

Merge branch 'dev' into dataseed_batch

f206200

Merge branch 'dev' into dataseed_batch

4e050b6

jwills closed this May 26, 2020

jwills deleted the dataseed_batch branch May 31, 2020 20:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP for launching batch runs on AWS directly from COVIDScenarioPipeline #223

WIP for launching batch runs on AWS directly from COVIDScenarioPipeline #223

jwills commented Apr 20, 2020

samshah Apr 20, 2020

samshah Apr 20, 2020

jwills Apr 20, 2020

samshah Apr 20, 2020

jwills Apr 20, 2020

samshah Apr 20, 2020

samshah Apr 20, 2020

jwills Apr 20, 2020

samshah Apr 20, 2020

samshah Apr 20, 2020

samshah commented Apr 20, 2020

perifaws Apr 21, 2020

perifaws Apr 21, 2020

jkamins7 commented Apr 21, 2020

WIP for launching batch runs on AWS directly from COVIDScenarioPipeline #223

WIP for launching batch runs on AWS directly from COVIDScenarioPipeline #223

Conversation

jwills commented Apr 20, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

samshah commented Apr 20, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkamins7 commented Apr 21, 2020