yzpt's Stars
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
Avaiga/taipy
Turns Data and AI algorithms into production-ready web applications in no time.
ageron/handson-ml3
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
plotly/dash-sample-apps
Open-source demos hosted on Dash Gallery
kubeflow/spark-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
google/temporian
Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖
tbaltrushaitis/cv
:mortar_board: Best in Class modern CV, Resume and Portfolio website template. All-in-One-Page site with simply customizable builder.
airscholar/e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
eren-ck/st_dbscan
ST-DBSCAN: Simple and effective tool for spatial-temporal clustering
dogukannulu/kafka_spark_structured_streaming
Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra
marcel-licence/esp32_usb_midi
ESP32 USB MIDI add-on for arduino synthesizer projects
robinson-wn/k8spark
Demonstration of PySpark in Docker
supergloo/kafka-examples
Now, you may not believe this based on the repo name, but this repo contains Kafka examples. Amazing right!? Have fun out there folks