Skip to content

Latest commit

 

History

History
8 lines (4 loc) · 477 Bytes

README.md

File metadata and controls

8 lines (4 loc) · 477 Bytes

Churn Prediction with PySpark

This Jupyter notebook runs through a simple tutorial of how churn prediction can be performed using Apache Spark.

It uses the Python API to perform basic analysis on the Orange Telco Churn Data, generate decision tree models using MLlib and construct a model selection pipeline with the ML package.

The notebook can be viewed here.