DoddiC's Stars
dmvaldman/library
📚 Papers and essays I find timeless
kubeshop/botkube
An app that helps you monitor your Kubernetes cluster, debug critical deployments & gives recommendations for standard practices
pditommaso/awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
HariSekhon/DevOps-Bash-tools
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
AkashRajpurohit/howtoprofessionallysay
📖 A guide for your daily "professional" interactions
mskadu/power-shell-scripts
A set of Power Shell Scripts I have developed/ improved
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
apache/datafusion-python
Apache DataFusion Python Bindings
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
kaxil/airflowctl
A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects
apache/datafusion
Apache DataFusion SQL Query Engine
dask/dask
Parallel computing with task scheduling
apache/iceberg
Apache Iceberg
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
apache/druid
Apache Druid: a high performance real-time analytics database.
apache/flink
Apache Flink
dataform-co/dataform
Dataform is a framework for managing SQL based data operations in BigQuery
spotify/scio
A Scala API for Apache Beam and Google Cloud Dataflow.
great-expectations/great_expectations
Always know what to expect from your data.
adilkhash/Data-Engineering-HowTo
A list of useful resources to learn Data Engineering from scratch
andkret/Cookbook
The Data Engineering Cookbook
gunnarmorling/awesome-opensource-data-engineering
An Awesome List of Open-Source Data Engineering Projects
awesome-spark/awesome-spark
A curated list of awesome Apache Spark packages and resources.
gtoonstra/etl-with-airflow
ETL best practices with airflow, with examples
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
datastacktv/data-engineer-roadmap
Roadmap to becoming a data engineer in 2021
ploomber/ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
apache/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
igorbarinov/awesome-data-engineering
A curated list of data engineering tools for software developers