Pinned Repositories
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
data-engineering-capstone
Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development
dl-projects
Deep Learning with Pytorch & fastai, 2018
gitlab-analytics
Fork of GitLab's Analytics Repo (DBT+Airflow)
missing-semester
Notes on The Missing Semester of Your CS Education (https://missing.csail.mit.edu/)
udacity-dend
Data Engineering Nanodegree
data-diff
Compare tables within or across databases
dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
audit-ai
detect demographic differences in the output of machine learning models or other assessments
danieldiamond's Repositories
danieldiamond/gitlab-analytics
Fork of GitLab's Analytics Repo (DBT+Airflow)
danieldiamond/missing-semester
Notes on The Missing Semester of Your CS Education (https://missing.csail.mit.edu/)
danieldiamond/embankment
Using Laplace and 2nd order partial differential equations in python to solve engineering design problems.
danieldiamond/airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
danieldiamond/airbyte-cli
This is the next generation Go based Airbyte CLI.
danieldiamond/airbyte-connectors
Airbyte connectors (sources & destinations) + Airbyte CDK for JavaScript/TypeScript
danieldiamond/airbyte-helm-chart
Hosted airbyte helm chart
danieldiamond/dagster
A Python library for building data applications: ETL, ML, Data Pipelines, and more.
danieldiamond/danieldiamond.github.io
🌐 Personal Webpage https://danieldiamond.github.io
danieldiamond/data-diff
Efficiently diff data in or across relational databases
danieldiamond/dbt-cloud-api-postman
danieldiamond/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
danieldiamond/dbt-duckdb
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
danieldiamond/dbt-event-logging
a dbt package to make auditing dbt runs easy.
danieldiamond/dbt-streamlit
danieldiamond/dbt-sugar
dbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
danieldiamond/dbt-utils
Utility functions for dbt projects.
danieldiamond/docs.getdbt.com
The code behind docs.getdbt.com
danieldiamond/esky
A Flask Server to run and store notebooks asynchronously using huey workers.
danieldiamond/fastai2
Temporary home for fastai v2 while it's being developed
danieldiamond/hanukkahofdata
https://hanukkah.bluebird.sh/
danieldiamond/kubetest
Kubernetes integration tests in Python
danieldiamond/looker_deployer
A tool to help deploy objects from one Looker instance to another
danieldiamond/mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
danieldiamond/marqo
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
danieldiamond/py-marqo
Python client for Marqo
danieldiamond/python-metaclasses
A deep dive into python metaclasses and best practices
danieldiamond/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
danieldiamond/tutorial
danieldiamond/vscode-dbt-power-user
This extension makes vscode seamlessly work with dbt.