akalautaro's Stars
awesome-selfhosted/awesome-selfhosted
A list of Free Software network services and web applications which can be hosted on your own servers
conanbatt/interview-practice
A repo for interview practice.
Raphire/Win11Debloat
A simple, easy to use powershell script to remove bloatware apps from windows, disable telemetry, bing in windows search aswell as perform various other changes to declutter and improve your windows experience. This script works for both windows 10 and windows 11.
astral-sh/rye
a Hassle-Free Python Experience
josephmachado/data_engineering_best_practices
Sample project to demonstrate data engineering best practices
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
data-mie/dbt-profiler
Macros for generating dbt model data profiles
DataRecce/recce
Data Reconnaissance - pull request review tool for dbt projects
vladkens/twscrape
2024! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.
dostonnabotov/introduction-to-algorithms
📚 Introduction to Algorithms
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
garjita63/de-zoomcamp-2024
mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
natayadev/dataengineering-roadmap
Un repositorio más con conceptos básicos, desafíos técnicos y recursos sobre ingeniería de datos en español 🧙✨
10Kang/DE_Zoomcamp2024_ZY
Repository for Data Engineering Zoomcamp 2024
iobruno/data-engineering-zoomcamp
Data Engineering examples covering Airflow and Mage for workflows; dbt for BigQuery, Redshift, ClickHouse; Spark and Kafka for Batch/Streaming Processing
spark-examples/pyspark-examples
Pyspark RDD, DataFrame and Dataset Examples in Python language
AlexIoannides/pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
codin-eric/hooks_base
gtoonstra/etl-with-airflow
ETL best practices with airflow, with examples
blindma1den/Programming-Skills-Level1
TrivadisPF/platys-modern-data-platform
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
andkret/Cookbook
The Data Engineering Cookbook
tuanavu/airflow-tutorial
Apache Airflow tutorial
PiConsulting/Pensadero
This is where we put useful code for our daily job with data.
eric-czech/data-engineer-challenge
NBS Recruiting challenge given to prospective data engineers
archie-cm/IBM-Data-Engineering-Capstone-Project
Business challenge that requires building a data platform for retailer data analytics.
sonarsushant/Sapient-Data-Engineer-Challenge
Created a data pipeline to stream data and generate real-time alerts using NiFi, Kafka and Spark
rcourivaud/data-engineering-coding-challenge