etl-process

There are 96 repositories under etl-process topic.

imsanjoykb/Data-Science-Regular-Bootcamp
Regular practice on Data Science, Machien Learning, Deep Learning, Solving ML Project problem, Analytical Issue. Regular boost up my knowledge. The goal is to help learner with learning resource on Data Science filed.
Language:Jupyter Notebook122 3 16546
taogeYT/pyetl
python ETL framework
Language:Python105 7 336
AndrejaCH/Movies-ETL
For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retrieve data from different sources, clean and transform it into a useful format and finally load the data into an SQL database where the data is ready for further analysis. The result is an established automated pipeline and a clean data set stored in an SQL database.
Language:Jupyter Notebook27 1 07
Wazzabeee/pyspark-etl-twitter
Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta Lake
Language:Python18 2 05
polakowo/yelp-3nf
3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow
Language:Jupyter Notebook12 2 03
TheCocoTeam/source-watcher-core
This is a PHP project which combines ETL with different strategies to extract data from multiple databases, files, and services, transform it and load it into multiple destinations.
Language:PHP9 2 220
thompson0012/PyEmits
Sugar candy for data scientist. Easy manipulation in time-series data analytics works.
Language:Python8 2 11
Steve0verton/google-maps-geocode-enrichment
This project repository provides a headless module to enrich location data in a database table using the Google Maps Geocode API.
Language:Python7 1 00
AleksaMCode/university-notices-email-notifier
Dynamic website scraper and email notifier.
Language:Python6 2 00
GhazaleZe/CourseShop_DataWarehouse
a data warehouse for an online course shop
Language:TSQL6 2 01
yekhanfir/Satisfaction-Analysis-Solution-For-Phone-Service-Providers
This is a sentimental analysis project that aims to provide a better insight on customers' satisfaction based on comments gathered (scrapped) from social media using google's Bert classification model.
Language:Jupyter Notebook6 2 02
hmignon/P2_BooksToScrape
Scraping BooksToScrape (P2 OC D-A Python) : Utiliser les bases de Python pour l'analyse de marché
Language:Python5 1 02
davideaimar/eth2dgraph
Extractor of Ethereum data to Dgraph format, utilities to analyse the indexed data.
Language:Rust4 1 13
polarbeargo/udacity-nd027-Data-Modeling-with-Postgres
Udacity nd027 Data Modeling with Postgres
Language:Jupyter Notebook4 2 02
emsalcengiz/data-normalize-with-etl-procesess
I made various data normalization operations with python scripts. Target data in CSV format
Language:Python3 1 01
NEXTSLIM/The-Music-has-Changed-Extract-transform-load-
We examine two data sets relate with the music Industry. We Extract, transform and load the data sets in order to create a data base and identify insides and trends about the music Industry.
Language:Jupyter Notebook3 1 00
aymane-maghouti/HR-Data-Pipeline-Azure
This project is a comprehensive data engineering solution that extracts HR data from a GitHub repository, performs data transformations using Azure services, and creates an interactive HR dashboard using Power BI. The goal is to enable HR professionals and decision-makers to gain insights from the HR data for better workforce management.
Language:Jupyter Notebook2 1 00
caesarmario/data-warehouse-credit-card-applicant-using-pentaho
This repository contains OLTP, ETL process (using Pentaho Data Integration), and OLAP of credit card dataset. The dataset is taken from Kaggle (https://www.kaggle.com/rikdifos/credit-card-approval-prediction) and part of author Capstone Project.
2 1 01
LIoccoUMD/ETL-Analysis
This project automates ETL for gym exercise data, predicting safety scores using KNN and optimizing with GridSearchCV. It generates recommendations, statistical summaries, and visualizations to improve gym safety and client retention. Logging ensures transparency.
Language:Python2 2 00
nickjlupu/Movies-ETL
An ETL process for a fictitious streaming service, Amazing Prime, was developed in Jupyter Notebook. The code was then refactored into a Python script to automate the ETL process.
Language:Jupyter Notebook2 1 00
V-MalM/ETL
A Case Study of Extract, Transform, Load. Documentaion includes sources of data, types of data wrangling performed (data cleaning, joining, filtering, and aggregating) and the schemata used in the final production database. Technologies used include Pandas, PostgreSQL, Jupyter Notebook.
Language:Jupyter Notebook2 0 01
Anurag-kumar-Molankala/Data-Professional-Survey
This Power BI dashboard analyzes survey responses from data professionals, covering key aspects such as salary distribution, job satisfaction, and preferred programming languages. The insights help understand trends in the data industry and what matters most to professionals.
1
Anurag-kumar-Molankala/Sales-Performance-Dashboard
A Power BI dashboard that analyzes sales trends, product performance, customer segmentation, and payment distribution. It uses DAX, time intelligence, and interactive visuals for data-driven insights. The model includes Sales, Product, and Customer tables for in-depth analysis.
1
bhammy27/Fantasy_Football_database_SQL
A desire to win my Fantasy Football leagues led to a realization that I have a passion for Data Analytics. I will create my own database using postgreSQL and pgAdmin.
1 1 00
buicongdanh/BI_DATH
Đồ án thực hành môn HTTT phục vụ Trí tuệ Kinh doanh, HCMUS K19 | Project for Information Systems for Business Intelligence course
Language:Jupyter Notebook1 1 00
danilosoftwares/BikeServerProcessador
Data Processor
Language:Python1 1 00
DCF0708/Amazon_Vine_Analysis
ETL and analysis of trends in product review data from Amazon Vine.
Language:Jupyter Notebook1 1 01
jacksonpf1/spotify-user-analysis
ETL process and EDA of user top artists & tracks data in Spotify using Spotipy, Pandas, Airflow and Seaborn
Language:Jupyter Notebook1 1 0
keity-p/Processo_de_ETL-_Projeto_Pix
Processo de ETL de dois data sets do Banco Central do Brasil. Para o projeto de Análise Exploratória de Dados sobre Pix.
Language:Jupyter Notebook1 1 00
NEXTSLIM/The-Music-has-Changed-WEBSIDE
We going to examine two data sets relate with the music Industry. We want Extract, transform and load this in order to identify insides and trend about the music Industry.
Language:CSS1 1 00
pzaino/microETL
A simple, reusable, templates based ETL (Extract, Transform and Load) library and framework written in Python
Language:Python1 1 01
SAZZAD-AMT/Informatica-Data-Integration-and-Transformation-Project
This process illustrates how to structure and manipulate relational databases effectively, demonstrating key SQL operations and transformations within an Informatica environment. The provided images and detailed SQL commands serve as a comprehensive guide for implementing and understanding these database management tasks.
1 1 02
ScuderiRosario/CryptoMundo
CryptoMundo is a simple and easy tool to analyze cryptocurrency data in real time which provides a simple and informative dashboard.
Language:Jupyter Notebook1 1 01
seyedmahdiamin1998/ETL_catawiki
ETL : Extract --> transform --> load
Language:Python1 1 00
shogunbanik18/budgetify
Your one-stop destination for managing budgets and gaining financial insights
Language:Python1 1 01
sidgolangade/Python-Scripts-For-ETL-Jobs
This repository hosts a collection of Python scripts designed to work with ETL jobs.
Language:Python1 1 0

etl-process

imsanjoykb/Data-Science-Regular-Bootcamp

taogeYT/pyetl

AndrejaCH/Movies-ETL

Wazzabeee/pyspark-etl-twitter

polakowo/yelp-3nf

TheCocoTeam/source-watcher-core

thompson0012/PyEmits

Steve0verton/google-maps-geocode-enrichment

AleksaMCode/university-notices-email-notifier

GhazaleZe/CourseShop_DataWarehouse

yekhanfir/Satisfaction-Analysis-Solution-For-Phone-Service-Providers

hmignon/P2_BooksToScrape

davideaimar/eth2dgraph

polarbeargo/udacity-nd027-Data-Modeling-with-Postgres

emsalcengiz/data-normalize-with-etl-procesess

NEXTSLIM/The-Music-has-Changed-Extract-transform-load-

aymane-maghouti/HR-Data-Pipeline-Azure

caesarmario/data-warehouse-credit-card-applicant-using-pentaho

LIoccoUMD/ETL-Analysis

nickjlupu/Movies-ETL

V-MalM/ETL

Anurag-kumar-Molankala/Data-Professional-Survey

Anurag-kumar-Molankala/Sales-Performance-Dashboard

bhammy27/Fantasy_Football_database_SQL

buicongdanh/BI_DATH

danilosoftwares/BikeServerProcessador

DCF0708/Amazon_Vine_Analysis

jacksonpf1/spotify-user-analysis

keity-p/Processo_de_ETL-_Projeto_Pix

NEXTSLIM/The-Music-has-Changed-WEBSIDE

pzaino/microETL

SAZZAD-AMT/Informatica-Data-Integration-and-Transformation-Project

ScuderiRosario/CryptoMundo

seyedmahdiamin1998/ETL_catawiki

shogunbanik18/budgetify

sidgolangade/Python-Scripts-For-ETL-Jobs