data-orchestration
There are 27 repositories under data-orchestration topic.
kestra-io/kestra
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
Alluxio/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
cubefs/cubefs
cloud-native distributed storage
apache/incubator-graphar
An open source, standard data file format for graph data storage and retrieval.
iam-mhaseeb/Skytrax-Data-Warehouse
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
jonathanneo/data-aware-orchestration
Data-aware orchestration with dagster, dbt, and airbyte
ozkary/data-engineering-mta-turnstile
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
kestra-io/examples
Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services
SAP-samples/btp-data-to-value-workshop
This repo contains a dataset, exercises, and sample code for an end-to-end SAP BTP data-to-value bootcamp covering SAP HANA Cloud, SAP Data Warehouse Cloud, SAP Data Intelligence Cloud, and SAP Analytics Cloud.
astronomer/airflow-provider-fivetran-async
A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran
Alluxio/k8s-operator
An operator for managing Alluxio system on Kubernetes cluster
anna-geller/kestra-ci-cd
CI/CD repository template to automate deployments of your production flows
dagster-io/dagster-quickstart
Get started with Dagster ASAP
anna-geller/kestra-terraform-examples
Bring Infrastructure as Code best practices to your data workflows with Kestra and Terraform
stemitom/postgres-pipeline
A simple pipeline infrastructure with ETL pipeline contained in a Docker environment on Apache Airflow for orchestration and Postgres for data warehousing
longNguyen010203/Finance-Data-Ingestion-Pipeline-with-Kafka
Develop a real-time data ingestion pipeline using Kafka and Spark. Collect minute-level stock data from Yahoo Finance, ingest it into Kafka, and process it with Spark Streaming, storing the results in Cassandra. Orchestrated the workflow using Airflow deployed on Docker.
kestra-io/data-engineering-zoomcamp
Code for the Data Engineering Zoomcamp course
zpencerguy/superdoppler
Data orchestration repo with Docker deployment
Annielytix/azure-data-factory-data-vault
Working with SCD Type (Change Data Capture) and need a Data Vault model to test Azure Data Factory v2? - This Code with Help!
jasontanx/prefect-learning
Prefect - Data orchestration tool practice & learning
kingabzpro/5-Airflow-Alternatives-for-Data-Orchestration-Tutorial
Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI
ddeutils/ddeapp-flask
Full-Stack Data Orchestration from Yaml template with Flask & HTMX
MostafaNabilll/end2end_pipeline
End to End data engineering project
tanega/data-duck-pond
A poor-man's data lake fill with ducks
Wireforce-LLC/m3
☕ Data Orchestrator. Without abstractions
jacquessham/airflow_notes
Repository to store scripts and notes on Airflow
philiporlando/dagster_university
I created this repo to follow along with the examples in the Dagster University Essentials course.