CRich8's Stars
DataExpert-io/llm-driven-data-engineering
This is a public repository to go over all the LLM-driven data engineering concepts.
martandsingh/ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
josephmachado/efficient_data_processing_spark
Code for "Efficient Data Processing in Spark" Course
ankurchavda/streamify
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Zzdragon66/stock-streaming-project
Zzdragon66/university-reddit-data-dashboard
hieuimba/stock-mkt-dashboard
Daily US Stock Market summary with focus on price action & statistics
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
nama1arpit/reddit-streaming-pipeline
A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
marciovrl/fastapi
A simple example of using Fast API in Python.
digitalghost-dev/global-data-pipeline
Code Repository for my 3rd Data Project.
digitalghost-dev/premier-league
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
astronomer/airflow-dbt-demo
A repository of sample code to accompany our blog post on Airflow and dbt.
RSKriegs/finnhub-streaming-data-pipeline
Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more
stavrostheocharis/weather_data_retriever
Retriever of weather data
josephmachado/online_store
End to end data engineering project
farinisgeorge/RealEstateMarketAnalyzer
An API that analyses the housing market real time and presents market opportunities
HoracioSoldman/batch-processing-on-aws
With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset which is available on Transport For London (TFL) website. https://cycling.data.tfl.gov.uk
CemKeskin84/DataEng_Zoomcamp
jackgisby/tfl-bikes-data-pipeline
Processing TfL data for bike usage with Google Cloud Platform.
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!