Skip to content
#

apache-airflow

Here are 427 public repositories matching this topic...

A comprehensive data engineering pipeline, orchestrates data workflows with Apache Airflow, Python, Kafka, Zookeeper, Spark, and Cassandra. Containerized using Docker: to deploy and scale effortlessly. This Etsy API Data Pipeline extracts, processes, and analyzes Etsy marketplace data—retrieving product listings, shop details, and reviews.

  • Updated Jan 6, 2024

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

  • Updated Nov 1, 2023
  • Python

Improve this page

Add a description, image, and links to the apache-airflow topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache-airflow topic, visit your repo's landing page and select "manage topics."

Learn more