PrimWILL's Stars
kubernetes/kubernetes
Production-Grade Container Scheduling and Management
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
ByteByteGoHq/system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
python/cpython
The Python programming language
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
openai/openai-cookbook
Examples and guides for using the OpenAI API
charlax/professional-programming
A collection of learning resources for curious software engineers
faif/python-patterns
A collection of design patterns/idioms in Python
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
open-guides/og-aws
📙 Amazon Web Services — a practical guide
karanpratapsingh/system-design
Learn how to design systems at scale and prepare for system design interviews
apache/kafka
Mirror of Apache Kafka
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
vectordotdev/vector
A high-performance observability data pipeline.
apache/hadoop
Apache Hadoop
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
datahub-project/datahub
The Metadata Platform for your Data Stack
igorbarinov/awesome-data-engineering
A curated list of data engineering tools for software developers
japila-books/apache-spark-internals
The Internals of Apache Spark
data-engineering-community/data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
conduktor/kafka-beginners-course
google/cluster-data
Borg cluster traces from Google
zzsza/Datascience-Interview-Questions
Datascience-Interview-Questions for Korean
alanchn31/Data-Engineering-Projects
Personal Data Engineering Projects
opendatadiscovery/awesome-data-catalogs
📙 Awesome Data Catalogs and Observability Platforms.
josephmachado/data_engineering_project_template
A template repository to create a data project with IAC, CI/CD, Data migrations, & testing
dhkdn9192/data_engineer_career
DE직무에 필요한 모든 것
wanteddev/wanted_jira_bolt
슬랙 스레드에 이모지를 달면 스레드 전체를 요약하여 Jira 티켓을 생성합니다!
1ambda/terraform-aws-eks-jupyterhub
Running JupyterHub on Kubernetes (AWS EKS) in 30 minutes :fire: