RamonPuon's Stars
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
ritchieng/the-incredible-pytorch
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
aome510/spotify-player
A Spotify player in the terminal with full feature parity
AdminTurnedDevOps/DevOps-The-Hard-Way-AWS
This repository contains free labs for setting up an entire workflow and DevOps environment from a real-world perspective in AWS
gunnarmorling/awesome-opensource-data-engineering
An Awesome List of Open-Source Data Engineering Projects
data-engineering-community/data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
amphi-ai/amphi-etl
Python-based Low-code ETL for data manipulation and transformation. Generates Python code you can deploy anywhere.
fmind/mlops-python-package
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
shafiab/HashtagCashtag
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - Apache Kafka for data ingestions, Apache Spark & Spark Streaming for batch & real-time processing, Apache Cassandra f or storage, Flask, Bootstrap and HighCharts f or frontend.
paxo-phone/PaxOS-8
Code source du système d'exploitation du PaxoPhone
hnawaz007/pythondataanalysis
Python data repo, jupyter notebook, python scripts and data.
DataTalksClub/project-of-the-week
Learn by doing: DIY project groups at DataTalks.Club
Snowflake-Labs/snowpark-python-demos
This repository provides various demos/examples of using Snowpark for Python.
riti2409/Computer-networks
Computer network notes
derar-alhussein/Databricks-Certified-Data-Engineer-Associate
The resources of the preparation course for Databricks Data Engineer Associate certification exam
dagster-io/fake-star-detector
OpenVisualCloud/Smart-City-Sample
The smart city reference pipeline shows how to integrate various media building blocks, with analytics powered by the OpenVINO™ Toolkit, for traffic or stadium sensing, analytics and management tasks.
dogukannulu/kafka_spark_structured_streaming
Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra
haraschax/nograd
Gradient descent is cool and all, but what if we could delete it?
johnny-chivers/pyspark-glue-tutorial
treeverse/lakeFS-samples
lakefs-samples repository
dbt-labs/dbot
An LLM-powered chatbot with the added context of the dbt knowledge base.
mansik95/IMDB-Analysis
This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
airscholar/changecapture-e2e
This project shows how to capture changes from postgres database and stream them into kafka
iaashu98/art-gallery-database-management
This project is about Art Gallery Database management system. This is basically consist of management of Users and Gallery database. This project manages orders, shows customer's , artist's, artwork's details.
risingwavelabs/risingwave-data-talks-workshop-2024-03-04
DataTalks Workshop Materials
imkaran45/spotify-end-to-end-aws-snowflake
airscholar/EMR-for-data-engineers
This project demonstrates the use of Amazon Elastic Map Reduce (EMR) for processing large datasets using Apache Spark. It includes a Spark script for ETL (Extract, Transform, Load) operations, AWS command line instructions for setting up and managing the EMR cluster, and a dataset for testing and demonstration purposes.