Data Engineering - Hours With Experts - Final Project
This project ties together several elements critical for a production level Data Engineering project. This includes:
- Kafka to provide streaming data
- HBase as another source for data enrichment
- HDFS for storing output
- Spark for tying all the pieces together
This is an SBT/Scala project, and utilizes the Amazon Public Reviews dataset.