BrianWW's Stars
public-apis/public-apis
A collective list of free APIs
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
dbeaver/dbeaver
Free universal database tool and SQL client
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
yeasy/docker_practice
Learn and understand Docker&Container technologies, with real DevOps practice!
spotify/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
apache/arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
pawl/awesome-etl
A curated list of awesome ETL frameworks, libraries, and software.
datafold/data-diff
Compare tables within or across databases
re-data/re-data
re_data - fix data issues before your users & CEO would discover them 😊
Multiwoven/multiwoven
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
pyenv/pyenv-virtualenvwrapper
an alternative approach to manage virtualenvs from pyenv.
Titan-Systems/titan
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.
pusher/pusher-http-python
Pusher Channels HTTP API library for Python
opendistro-for-elasticsearch/sample-code
👋 Welcome to the Open Distro sample-code area. Share your great ideas and code samples with the Open Distro Community.
tomasfarias/airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
dbt-labs/docs.getdbt.com
The code behind docs.getdbt.com
hellonarrativ/spectrify
Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.
hudl/luigi-monitor
Send summary messages of your Luigi jobs to Slack
openraven/mockingbird
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
rasgointelligence/RasgoTransforms
SQL-based transforms compatible with Rasgo and PyRasgo
dariocazas/howto-debezium-to-snowflake
Using Debezium to capture data changes from databases and populate these as historic evolution and table replication in Snowflake