Twitter data is extracted using ETL data pipelines and Airflow workflows are implemented.
-
Updated
Feb 15, 2023 - Python
Twitter data is extracted using ETL data pipelines and Airflow workflows are implemented.
Apache Airflow Cheatsheet
Automate Apache Spark in Hadoop with Airflow in Cloud
Process of scheduled data extraction, transform and load is done using Apache Airflow and PySpark
Starter package to setup Apache Airflow locally.
This repo contains the concepts of Apache Airflow and the practical implemetation I'll be doing while learning.
A simple dag for triggering the Cloud Data Fusion Pipeline using Apache Airflow.
An example Apache Airflow DAG-definition source repository, to be used with the Airflow DAG Aggregator.
Setup for Apache Airflow with Docker.
Udacity project within the Data Engineer Nanodegree
The ETL Pipeline using a way autoscaling
This project is the final project of the the Data Pipeline - Udacity module.
ETL pipeline built with Apache Airflow to prepare Uber and Lyft trip data for consumption by a Looker Studio report.
Project in Course of Udacity's Data Engineering Nano-Degree
Celery and Kubernetes operators are used in order to manage data engineering pipelines of stocks and cryptocurrencies prices
Playing around with Airflow
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."