emr
Here are 338 public repositories matching this topic...
Streaming pipeline using AWS MSK and AWS EMR with Spark, retrieving the data from Twitter Streams API
-
Updated
Sep 10, 2023 - HCL
-
Updated
May 26, 2024 - Jupyter Notebook
Apache Spark - From installation to performing awesome operations in Apache Spark Stack
-
Updated
May 8, 2017 - Python
Group 10 Project, Fall 2020, CS 6240: Large-Scale Parallel Data Processing, Khoury College of Computer Sciences, Northeastern University
-
Updated
Jul 12, 2021 - Scala
Assignment 2 of the course 'Distributed Systems Programming' by Meni Adler. In the assignment we build an application that calculates the probabilities for any word to come after a couple of words, for ANY couple of words in the n-gram corpus (google).
-
Updated
Feb 22, 2022 - Java
Start and monitor jobs on EMR cluster
-
Updated
Mar 15, 2024 - Python
Frontend for a distributed electronic health records system
-
Updated
May 21, 2023 - Svelte
Scripts for provisioning data science tools
-
Updated
May 26, 2018 - Shell
The EMR Helper library tries to help when setting up and managing an EMR cluster.
-
Updated
Sep 2, 2020 - Python
The emrstreaming provider offers continuous deployment functionality for streaming steps into an EMR cluster.
-
Updated
Mar 9, 2023 - Go
Improve this page
Add a description, image, and links to the emr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the emr topic, visit your repo's landing page and select "manage topics."