Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark-Dev: Enable Spark Development on AAW Notebooks #1883

Open
Souheil-Yazji opened this issue Nov 9, 2023 · 0 comments
Open

Spark-Dev: Enable Spark Development on AAW Notebooks #1883

Souheil-Yazji opened this issue Nov 9, 2023 · 0 comments
Assignees
Labels
good-first-issue Good for newcomers size/M 2-3 days

Comments

@Souheil-Yazji
Copy link
Contributor

Souheil-Yazji commented Nov 9, 2023

A continuation of #1867 dedicated for enabling spark development on jupyterlab notebooks.

Enabling Spark Development on AAW Notebooks

This dockerfile from Jupyterlab contains the installation and setup for spark, but is missing the installation of scala, which doesn't seem to be required. The get-spark-stuff.sh script in aaw-kubeflow-containers seems to pull these dockerfile and it is called in the makefile under generate-Spark. generate-Spark is not called anywhere when building the actual images, since we call generate-dockerfiles, which indicates that this isn't actually doing anything. We''ll want to add a call to generate-spark to produce the spark layer.

Scala can be installed with
pip3 install scala
or possibly with conda/mamba but not with the default channels (I believe bioconda has it).

Adding Spark to the Dockerfiles

https://github.com/StatCan/aaw-kubeflow-containers/pull/551/files

The get-spark-stuff shell script is simply concatinating the dockerfiles, which I'm not really a fan off. The layer also doesn't work out-of-the-box because some upstream files are missing from our docker context.

@Souheil-Yazji Souheil-Yazji added the kind/feature New feature or request label Nov 9, 2023
@Souheil-Yazji Souheil-Yazji self-assigned this Nov 9, 2023
@Souheil-Yazji Souheil-Yazji added good-first-issue Good for newcomers size/M 2-3 days and removed kind/feature New feature or request labels Nov 9, 2023
@Souheil-Yazji Souheil-Yazji changed the title Spark-Dev: Create Demo Spark app using AAW notebooks Spark-Dev: Enable Spark Development on AAW Notebooks Nov 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good-first-issue Good for newcomers size/M 2-3 days
Projects
None yet
Development

No branches or pull requests

1 participant