simonoke7's Stars
josephmachado/simple_polars_etl
sdw-online/code_examples_library
The code examples from my online content
josephmachado/data_engineering_best_practices
Sample project to demonstrate data engineering best practices
kazarmax/soda_core_snowflake
Using Soda Core (CLI and Python package) to check data quality of Superstore dataset in Snowflake
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Avaiga/taipy
Turns Data and AI algorithms into production-ready web applications in no time.
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
escobar-west/polars-cookbook
Recipes for using Python's polars library
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
kaxil/airflowctl
A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects
RamiKrispin/vscode-python
A Tutorial for Setting Python Development Environment with VScode and Docker
jpmorganchase/python-training
Python training for business analysts and traders
Kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
microsoft/dbt-fabric
Paulescu/real-time-ohlc-with-bytewax
Fetch, transform and plot real-time OHLC data from Coinbase using Bytewax, Bokeh and Streamlit
alfredosa/airflow-dbt-metabase
vinamrgrover/ETL-DynamoDB-to-S3
Batch ETL Pipeline built on AWS
Modulos/data_copilot
Data Copilot is the framework which makes your chat bot enterprise ready with only few lines of code.
im-nsk/StockMarketScraper-Extracting-Real-Time-Stock-Data-from-Yahoo-Finance
In this web scraping project, my goal is to extract real-time stock market data from the renowned Yahoo Finance website. By leveraging web scraping techniques, I am able to capture up-to-date information on stock prices, volume, market caps, and other key metrics.
cookiecutter/cookiecutter
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
priye-1/airflow_data_pipeline
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
degagawolde/data-warehouse-dbt-airflow-postgress
A data-warehouse built for the pNEUMA open dataset of naturalistic trajectories of half a million vehicles collected by a swarm of drones in a congested downtown area of Athens, Greece.
confluentinc/demo-change-data-capture
This demo shows how to capture data changes from relational databases (Oracle and PostgreSQL) and stream them to Confluent Cloud, use ksqlDB for real-time stream processing, send enriched data to cloud data warehouses (Snowflake and Amazon Redshift).
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
im-nsk/Building-an-Automated-Weather-Data-Pipeline-with-Airflow-From-Ingestion-to-Data-Warehouse
This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeatherAPI and loading it into a data warehouse, specifically AWS Redshift.
sbalnojan/FDE-airflow-tutorial
Functional Data Engineering tutorial in Python & Airflow.
keplergl/kepler.gl
Kepler.gl is a powerful open source geospatial analysis tool for large-scale data sets.
openai/openai-cookbook
Examples and guides for using the OpenAI API
tomatminceddata/learningdeneb