lucastancredi's Stars
databricks/delta-live-tables-notebooks
MrPowers/quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
MrPowers/chispa
PySpark test helper methods with beautiful error messages
awslabs/python-deequ
Python API for Deequ
GersonRS/hands-on-development-environment-with-kubernetes
Hands on | Criação de um Ambiente de Desenvolvimento para Engenharia de Dados com Kubernetes
ByteByteGoHq/system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
rogeriomm/labtools-k8s
Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,Airflow, Kafka Strimzi, Datahub, OpenMetadata,Zeppelin, Jupyter, JFrog Container Registry
ifood/ifood-data-engineering-test
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
aureliowozhiak/curso-engenharia-de-dados
git-tips/tips
Most commonly used git tips and tricks.
JohnMiner3/community-work
Presentations given to the Data Platform community.
adidas/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
lvgalvao/DataProjectStarterKit
Estrutura completa para iniciar um projeto de dados com Python, abrangendo ambiente, git, desenvolvimento, testes e documentação.
TeoMeWhy/data-4u
google/styleguide
Style guides for Google-originated open-source projects
minrk/findspark
palantir/pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
TeoMeWhy/teomerefs
Guia de referências técnicas para carreira em dados
reddelexc/hackerone-reports
Top disclosed reports from HackerOne
pedrogusmao/Migracao
Migração Azure SQL via Databricks
TeoMeWhy/introducao-programacao-python
Curso de Introdução a Programação com Python realizado entre Instituto Aaron Swartz e Téo Me Why
cartershanklin/pyspark-cheatsheet
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
razevedo1994/data_engineer_roadmap
Personal roadmap to guide my studies.
TheAlgorithms/Python
All Algorithms implemented in Python
san089/goodreads_etl_pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
alinebastos/free-courses
Free IT courses
TeoMeWhy/olist-ml-models
Projeto de Machine Learning do início ao fim no contexto de um e-commerce
andresionek91/CorreiosPrecoPrazo
Correios Preços e Prazos - Python Wrapper
egonSchiele/grokking_algorithms
Code for the book Grokking Algorithms (https://amzn.to/29rVyHf)