Pinned Repositories
100DaysOfContainersAndOrchestration
Your go-to open source repo to learn containers (Docker, Podman, etc.) and Orchestration (Kubernetes, ECS, etc.) from start to finish.
advanced-data-engineering-with-databricks
airflow-tutorial
Apache Airflow tutorial
AnalyticsinaBox
FTA Toolkit - Analytics in a Box
apache-spark-programming-with-databricks
api-guidelines
Microsoft REST API Guidelines
architecture-center
Azure Architecture Center
azure-content
Repository containing the Articles on azure.microsoft.com Documentation Center
azure-databricks-dev-guide
azure-rest-api-specs
The source for REST API specifications for Microsoft Azure.
0xbadidea's Repositories
0xbadidea/100DaysOfContainersAndOrchestration
Your go-to open source repo to learn containers (Docker, Podman, etc.) and Orchestration (Kubernetes, ECS, etc.) from start to finish.
0xbadidea/AnalyticsinaBox
FTA Toolkit - Analytics in a Box
0xbadidea/apache-spark-programming-with-databricks
0xbadidea/azure-synapse-analytics-end2end
Azure Analytics End to End with Azure Synapse - Deployment Accelerator
0xbadidea/AzureMasterClass
Repo for the Azure Master Class
0xbadidea/chispa
PySpark test helper methods with beautiful error messages
0xbadidea/comprehensive-rust
This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust to everyone.
0xbadidea/data-diff
Efficiently diff data in or across relational databases
0xbadidea/data-engineering-with-databricks-english
0xbadidea/database-lab-engine
DLE enables 🖖 DB branching and ⚡️ thin cloning for any Postgres database and empowers DB testing in CI/CD. This optimizes database-related costs while improving time-to-market and software quality. Follow to stay updated.
0xbadidea/databricks-observability
OpenTelemetry Demo with Azure Databricks and Azure Monitor
0xbadidea/dbdemos
Demos to implement your Databricks Lakehouse
0xbadidea/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
0xbadidea/dlt-meta
This is metadata driven DLT based framework for bronze/silver pipelines
0xbadidea/FTALive-Sessions
This repository is a public-facing source of information for FastTrack for Azure Live sessions.
0xbadidea/ide-best-practices
Best practices for working with Databricks from an IDE
0xbadidea/joy-of-system-design
An online game to kindle the spark of system design in you.
0xbadidea/learn-databricks
Notebooks to learn Databricks Lakehouse Platform
0xbadidea/modern-data-warehouse-dataops
DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
0xbadidea/notebook-best-practices
An example showing how to apply software engineering best practices to Databricks notebooks.
0xbadidea/public-apis
A collective list of free APIs
0xbadidea/spark-local-execution
0xbadidea/spark-style-guide
Spark style guide
0xbadidea/spark-testing-base
Base classes to use when writing tests with Spark
0xbadidea/splink
Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
0xbadidea/splink_demos
Interactive notebooks containing demonstration code of the splink library
0xbadidea/sqlglot
Python SQL Parser and Transpiler
0xbadidea/talent-plan
open source training courses about distributed database and distributed systems
0xbadidea/the-algorithm
Source code for Twitter's Recommendation Algorithm
0xbadidea/the-algorithm-ml
Source code for Twitter's Recommendation Algorithm