vaosinbi's Stars
kelseyhightower/kubernetes-the-hard-way
Bootstrap Kubernetes the hard way. No scripts.
DataTalksClub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
ageron/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
getsops/sops
Simple and flexible tool for managing secrets
GoogleCloudPlatform/terraformer
CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code
awslabs/git-secrets
Prevents you from committing secrets and credentials into git repositories
sqlfluff/sqlfluff
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
stanfordnlp/GloVe
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
TobikoData/sqlmesh
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Shopify/ejson
EJSON is a small library to manage encrypted secrets using asymmetric encryption.
astronomer/dag-factory
Dynamically generate Apache Airflow DAGs from YAML configuration files
google/eng-edu
databricks/devrel
This repository contains the notebooks and presentations we use for our Databricks Tech Talks
PacktPublishing/Data-Engineering-with-Python
Data Engineering with Python, published by Packt
GoogleCloudPlatform/bigquery-oreilly-book
Source code accompanying: BigQuery: The Definitive Guide by Lakshmanan & Tigani to be published by O'Reilly Media
microsoft/nutter
Testing framework for Databricks notebooks
terraform-google-modules/terraform-google-bigquery
Creates opinionated BigQuery datasets and tables
aws-samples/data-engineering-for-aws-immersion-day
Lab Instructions for Data Engineering Immersion Day
stelligent/stelligent-u
Templates and code for Stelligent U lessons
SnowflakeDefinitiveGuide/1st-Edition
Source Code Collection and Supplemental Material for the O'Reilly Snowflake Definitive Guide 1st Edition book
mozilla/gcp-ingestion
Documentation and implementation of telemetry ingestion on Google Cloud Platform
oryanmoshe/debezium-timestamp-converter
aws-samples/redshift-etl-automation-with-dbt
fivetran/snowflake_fivetran_vhol
Sample dbt project for the Snowflake + Fivetran Virtual Hands-On Lab
RedHatInsights/expandjsonsmt
Kafka Connect SMT to expand JSON field
GoogleCloudPlatform/bq-mirroring-cdc
jasonsmithio/serverless-eventing
Jay Smith's Serverless Eventing Journey
iht/python-profiling-beam-summit-2021
This repository contains a streaming Dataflow pipeline written in Python with Apache Beam, reading data from PubSub.
92twinturboz/kafka-admin-client-example
Quick and dirty example of how to use the kafka admin client with CCloud to create and modify a topic config.