Skip to content

Latest commit

 

History

History
10 lines (8 loc) · 414 Bytes

README.md

File metadata and controls

10 lines (8 loc) · 414 Bytes

de-hwe-final

Data Engineering - Hours With Experts - Final Project

This project ties together several elements critical for a production level Data Engineering project. This includes:

  • Kafka to provide streaming data
  • HBase as another source for data enrichment
  • HDFS for storing output
  • Spark for tying all the pieces together

This is an SBT/Scala project, and utilizes the Amazon Public Reviews dataset.