airflow-docker
There are 151 repositories under airflow-docker topic.
coder2j/airflow-docker
Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)
Wittline/uber-expenses-tracking
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
data-burst/airflow-git-sync
Sync DAG changes from Git to Airflow
anastasiia-p/airflow-ml
Airflow Pipeline for Machine Learning
sergio11/lyric_wave_architecture
🎵 LyricWave – AI Music Composer (Proof of Concept) 🎶 A personal project exploring automatic generation of unique MP4 songs. LyricWave blends lyrics with AI-generated melodies and synthetic vocals to experiment with new forms of musical expression. A creative testbed to push your ideas into sound. 🚀🎧
anilkulkarni87/airflow-docker
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
mikestack15/orangutan-stem
An open-source project dedicated to constructing robust data pipelines and scalable software infrastructure. We leverage industry-standard tools favored by developers to enhance efficiency and reliability. Uniquely, these pipelines are field-tested on farms across Sumatra, Indonesia, ensuring real-world applicability and resilience.
DatacollectorVN/Airflow-Tutorial
My self-learning about Apache Airflow
abhioncbr/docker-airflow
Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
mikeroyal/Apache-Airflow-Guide
Apache Airflow Guide
judeleonard/Prescriber-ETL-data-pipeline
An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports
rahul765/Machine-Learning-Pipelines
From data gathering to model deployment. Complete ML pipeline using Docker, Airflow and Python.
jksinghpro/docker-airflow
Docker for airflow with mysql as backend
okzapradhana/etl-flatfile-airflow
Building Data Warehouse on BigQuery which takes flat file as the data sources with Airflow as the Orchestrator
resdevd/docker-compose-airflow
Quick Docker Compose deployment for Apache Airflow version 2.x. Ideal for local development
ericdaat/data-stack
A starting point for a data stack using Python, Apache Airflow and Metabase.
knowsuchagency/docker-compose-airflow
An airflow deployment configuration with sane defaults
vjgpt/twitter-pipeline
In this project, you will be building a Twitter Scheduler using Apache Airflow on Docker.
pacuna/airflow-docker
Run Apache Airflow using Docker containers
AhmetFurkanDEMIR/airflow-spark-kafka-example
Airflow, Spark and Kafka example
Aiscalate/aiscalator
Tools to streamline Jupyter Notebook Prototypes into robust Data Products
Booss3my/Indeed_scraper
Automated Indeed Job Offer Scraper: Airflow Orchestrated and Scheduled, Data Loaded into PostgreSQL Database
Data-Bishop/Team5-BuildItAll-Data-Platform
This repository contains the codebase for the BuildItAll Big Data Processing Platform, a case study project designed to manage large daily data for a hypothetical Belgian client.
prasadanilmore/Flask-Airflow-ETL
This documentation provides an overview of the tasks completed in this project. The project comprises three key tasks: ETL (Extract, Transform, Load), API development, and Data Orchestration using Airflow. Each task is detailed below along with explanations of design choices and considerations.
rushabh2390/SharesETL
Shares ETL to scrape historical data of one shares and import in metabase
DunnBC22/Data_Engineering_Projects
This repository includes data engineering projects using Apache Airflow. I hope to add more projects using different technologies soon!
gustschaefer/Twitter-Batch-ETL
ETL que extraí diariamente os trending topics do Twitter em diversos países, realiza transformações com PySpark e envia os dados para o Amazon S3 utilizando Apache Airflow como orquestrador.
saianupkumarp/airflow-docker
Airflow docker boiler plate with LocalExecutor, CeleryExecutor. #TODO Kubernetes
sreekesh93/airflow_beginners
to start airflow in local with basic setup
abhishekbhakat/Turbine
Simple python script for easy local airflow deployment with docker. Packed with additional components. Will be adding more going forward.
Chinaskidev/ETL-Clima-ElSalvador
MLOps, haciendo un ETL sencillo usando Docker y Airflow y Google Cloud
erik1110/invoice
iThome 15th-ironman (2023) - Creating an Invoice System with Python (Check Your Own Invoices)
kaushil24/llm-finetune-pipeline
Airflow pipeline to finetune LLM on Kubernetes
mpavanetti/airflow
This set of code and instructions has the porpouse to instanciate a compiled environment with set of docker images like airflow webserver, airflow scheduler, postgresql, pyspark, Data Pipeline consuming data from weather api , processing with pyspark and storing in postgresql
Pirate-Emperor/BigData-Pipeline
BigData Pipeline is a local testing environment for experimenting with various storage solutions (RDB, HDFS), query engines (Trino), schedulers (Airflow), and ETL/ELT tools (DBT). It supports MySQL, Hadoop, Hive, Kudu, and more.
tarmkalavan/selenium-airflow
Using Airflow and Selenium to automate PM2.5 forecasting, from feature scraping (metereological data from earth.nullschool.net) to reporting.