Skip to content

simple-dev-tools/spark-scaffold

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark Scaffold

Spark Scaffold CI

A framework to develop production grade Spark jobs.

The Idea

The key concept of this framework is the Run Context, which made of three major components,

  1. The Spark session - which you have to use it for Transformation and Action
  2. The Parameters (parsed arguments from command input) - which from the spark-submit command
  3. The config - which should have different config for different environments.

This framework helps you to manage those three components and enable you focus on the actual business logic - DataFrame transformation.

Quick Start

To build and run test cases, just simply run,

sbt test

To package the Jar run,

make build-spark-jar

The sbt-assembly plugin to build a fat Jar with dependencies

To run the example Spark jobs as local mode, just run,

make submit-job

Use it in your project

About

A framework to develop production grade Spark jobs

Resources

Stars

Watchers

Forks

Packages

No packages published