etl-pipelines

There are 16 repositories under etl-pipelines topic.

yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust651 11 2427
patterns-app/patterns-devkit
Data pipelines from re-usable components
Language:Python108 4 415
level-vc/useful
The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.
Language:Python20 2 01
Chek0rrdn/DataEngineer_ETL
A project structure for doing and sharing data engineer work.
Language:Python8 1 00
abrahamkoloboe27/Airflow-Pipeline-Dashboard-Compagnie-Aerienne
Lien de l'application
Language:Python5
angelxd84130/Airflow-ETL
Build ETL piplines on AirFlow to load data from BigQuery and store it in MySQL
Language:Python1 2 01
ChristianRCanlas/ChristianRCanlas.github.io
e-Portfolio showcasing my personal projects.
Language:Python1 1 00
EmmanuelEzenwere/DataSift
DataSift auto applies a data pre-processing pipeline to Data Science Projects.
Language:Python10
prneidhardt/Apache-Data-Pipeline
Sparkify project
Language:Jupyter Notebook1 1 00
extralo/loom
Weaving together different threads (services like image/audio converse, ETL services, etc.) to enable the World Wide Flow
Language:JavaScript0 1 00
Guilherme-B/baboon
JSON-driven ETL pipeline framework prototype
Language:Python0 1 00
siddarthaThentu/Disaster-Response-Pipeline
A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nanodegree program.
Language:Python0 2 00
speedbits/LimitlessETL
A Python and Spark based ETL framework. While it operates within speed limits that is framework and standards, but offers boundless possibilities.
Language:Python0 1 00
juniors90/PymaciesArg
An extension that registers all pharmacies in Argentina.
Language:Python1 0
omar-elmaria/airflow_local
This repo contains the DAGs that run on my local Airflow environment. I use the local environment to test my DAGs before deploying them to virtual machines via Kubernetes
Language:Python1 0
SayamAlt/Formula-1-Data-Ingestion-Transformation---ETL-Pipeline
This project demonstrates a complete ETL pipeline for Formula 1 racing data using Azure Databricks, Delta Lake, and Azure Data Factory. It covers data ingestion, transformation with PySpark and Spark SQL, data governance with Unity Catalog, and visualization through Power BI. Designed to showcase real-world data engineering workflows in Azure.
Language:Python

etl-pipelines

yobix-ai/extractous

patterns-app/patterns-devkit

level-vc/useful

Chek0rrdn/DataEngineer_ETL

abrahamkoloboe27/Airflow-Pipeline-Dashboard-Compagnie-Aerienne

angelxd84130/Airflow-ETL

ChristianRCanlas/ChristianRCanlas.github.io

EmmanuelEzenwere/DataSift

prneidhardt/Apache-Data-Pipeline

extralo/loom

Guilherme-B/baboon

siddarthaThentu/Disaster-Response-Pipeline

speedbits/LimitlessETL

juniors90/PymaciesArg

omar-elmaria/airflow_local

SayamAlt/Formula-1-Data-Ingestion-Transformation---ETL-Pipeline