Skip to content

jairamc/docker-spark-kafka-streaming

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

docker-spark-kafka-streaming

Docker files for starting a containers with Spark, enabled with Kafka streaming and Apache Toree notebook

This is currently a very early attempt at getting this set up. Will clean things up in the coming days and add more documentation. Use at your own risk!

Kafka

See wurstmeister/kafka-docker for details

Spark

See spark/Dockerfile to see how the Spark image was set up. Notice that the Dockerfile copies the spark-kafka-streaming assembly jar into $SPARK_HOME/jars

Apache Toree Notebook

See spark-notebook/Dockerfile to see how the Apache Toree notebook is set up. The notebook is set up to connect to the Spark container in the docker-compose file. It also attaches a notebooks volume that contains a sample Spark notebook.

About

Docker files for starting a containers with Spark, enabled with Kafka streaming and Apache Toree notebook

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published