hailampy123's Stars
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
marcel-dempers/docker-development-youtube-series
arp242/goatcounter
Easy web analytics. No tracking of personal data.
tensorchord/Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
ayastreb/money-tracker
:moneybag: Personal finances tracking web app
ankurchavda/SparkLearning
A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.
thinh-vu/vnstock
A powerful Python library for getting rich data from the Vietnam Stock Market using just a few lines of code
ABZ-Aaron/Reddit-API-Pipeline
rittmananalytics/ra_data_warehouse
This dbt package contains a set of pre-built, pre-integrated Load and Transform dbt models for common SaaS applications.
darshilparmar/stock-market-kafka-data-engineering-project
karol-brejna-i/locust-experiments
Series of experiments with load testing tool called Locust (locust.io)
dogukannulu/kafka_spark_structured_streaming
Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra
SteveHedden/kg_llm
Integrating knowledge graphs (KG) with large language models (LLM)
darshilparmar/apache-spark-with-data-bricks-for-data-engineering
apache-spark-with-databricks-for-data-engineering
abbos0123/Design-Patterns
Realsid/databricks-spark-certification
Guide for databricks spark certification
duyet/realtime-dashboard
Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js
simardeep1792/Data-Engineering-Streaming-Project
dogukannulu/airflow_kafka_cassandra_mongodb
Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
moontucer/Data-Streaming-Project
used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline
MarcosMJD/ghcn-d
Data Pipeline from the Global Historical Climatology Network DataSet
Vinodkumar-yerraballi/Pesonal-Projects
aws-samples/monitoring-apache-iceberg-table-metadata-layer
Sample code to collect Apache Iceberg metrics for table monitoring
duyet/shopee-track-demo
Demonstrates how to schedule GitHub Workflows to run scripts for monitoring product availability on the shopee.com
ANelson82/de_zoomcamp_2022_earthquake_capstone
weslleylc/Feature-Store
A containerized approach using Apache Kafka, Spark, Cassandra, Hive, Jupyter, and Docker-compose.
taboularasa/learn_sql_the_hard_way
fiatttkk/clarissa_project
vdn-projects/music-weblog-streaming-data-pipeline