Skip to content

iahsanujunda/ml_sagemaker_studies

Repository files navigation

Machine Learning, Deployment Case Studies with AWS SageMaker

This repository contains code and associated files for deploying ML models using AWS SageMaker, several sample notebooks exist spanning various case studies, code exercises, and project files that will illustrate parts of the ML workflow and give enough practice deploying a variety of ML algorithms on Sagemaker.

Case Study Samples

  • Population Segmentation: Learn how to build and deploy unsupervised models in SageMaker. In this example, you'll cluster US Census data; reducing the dimensionality of data using PCA and the clustering the resulting, top components with k-means.
  • Payment Fraud Detection: Learn how to build and deploy a supervised, LinearLearner model in SageMaker. You'll tune a model and handle a case of class imbalance to train a model to detect cases of credit card fraud.
  • Deploy a Custom PyTorch Model (Moon Data): Train and deploy a custom PyTorch neural network that classifies "moon" data; binary data distributed in moon-like shapes.
  • Time Series Forecasting: Learn to analyze time series data and format it for training a DeepAR algorithm; a forecasting algorithm that utilizes recurrent neural network. Train a model to predict household energy consumption patterns and evaluate the results.

Project

Plagiarism Detector: Build an end-to-end plagiarism classification model. All skills shown on the previous learnings are applied here in the process of cleaning data, extract meaningful features, and deploy a plagiarism classifier in SageMaker.

Examples of dimensionality reduction and time series prediction


Setup Instructions

These notebooks are intended to be executed on Amazon's SageMaker platform. Follow the instruction below to provision a notebook instance, from which all notebooks can be run.

Log in to the AWS console and create a notebook instance

Log in to the AWS console and go to the SageMaker dashboard. Click on 'Create notebook instance'.

  • The notebook name can be anything and using ml.t2.medium is a good idea as it is covered under the free tier.
  • For the role, creating a new role works fine. Using the default options is also okay.
  • It's important to note that you need the notebook instance to have access to S3 resources, which it does by default. In particular, any S3 bucket or object, with “sagemaker" in the name, is available to the notebook.
  • Use the option to git clone the project repository into the notebook instance by pasting https://github.com/iahsanujunda/ml_sagemaker_studies.git

Open and run the notebook of your choice

Now that the repository has been cloned into the notebook instance you may navigate to any of the notebooks that you wish to complete or execute and work with them. Additional instructions are contained in their respective notebooks.

About

Case studies, examples, and exercises for learning to deploy ML models using AWS SageMaker.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published