zxy-zxy's Stars
meltano/meltano
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
DataTalksClub/llm-zoomcamp
LLM Zoomcamp - a free online course about building a Q&A system
DataTalksClub/mlops-zoomcamp
Free MLOps course from DataTalks.Club
hyunjun/bookmarks
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
kristeligt-dagblad/dbt_ml
Package for dbt that allows users to train, audit and use BigQuery ML models.
edx/snowflake_timetravel_table
A table-type dbt materialization for Snowflake to enable Time Travel
erika-e/dbt-tips
Collection of dbt Tips and Tricks
MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
ebhy/budgetml
Deploy a ML inference service on a budget in less than 10 lines of code.
palantir/pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
linkedin/iris
Iris is a highly configurable and flexible service for paging and messaging.
arjones/bigdata-workshop-es
Workshop Big Data en Español
spbail/dag-stack
Data pipeline with dbt, Airflow, Great Expectations
astronomer/airflow-dbt-demo
A repository of sample code to accompany our blog post on Airflow and dbt.
calogica/dbt-expectations
Port(ish) of Great Expectations to dbt test macros
aphyr/distsys-class
Class materials for a distributed systems lecture series
astronomer/dag-factory
Dynamically generate Apache Airflow DAGs from YAML configuration files
etsy/boundary-layer
Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform
soggycactus/airflow-repo-template
The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.
jghoman/awesome-apache-airflow
Curated list of resources about Apache Airflow
teamclairvoyant/airflow-maintenance-dags
A series of DAGs/Workflows to help maintain the operation of Airflow
deordie/deordie-digest
Data Engineering Digest
jsonsystems/json-schema
JSONSchema.Net Public Repository
slgero/testovoe
Home assignments for data science positions
oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
sungchun12/airflow-toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) :desktop_computer: >> [ :rocket:, :ship: ]
davidfoerster/aptsources-cleanup
Detects and interactively deactivates duplicate Apt source entries and deletes sources list files without valid enabled source entries (as requested in https://askubuntu.com/a/762815/175814).
puckel/docker-airflow
Docker Apache Airflow