Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
-
Updated
May 25, 2024 - TypeScript
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
A machine-readable, human-editable database of the Yu-Gi-Oh! Trading Card Game, Official Card Game, Master Duel, Rush Duel, Speed Duel.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Monopoly is a Python library that extracts transactions from bank statement PDFs
The open source high performance ELT framework powered by Apache Arrow
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
An orchestration platform for the development, production, and observation of data assets.
Airbyte connectors (sources & destinations) + Airbyte CDK for JavaScript/TypeScript
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."