adnanalvee's Stars
Azure/azure-quickstart-templates
Azure Quickstart Templates
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
davidADSP/Generative_Deep_Learning_2nd_Edition
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
luisguiserrano/manning
Repository for the book Grokking Machine Learning, by Manning Editors
capitalone/datacompy
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
databricks/terraform-provider-databricks
Databricks Terraform Provider
awslabs/amazon-kinesis-producer
Amazon Kinesis Producer Library
databrickslabs/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
databrickslabs/ucx
Automated migrations to Unity Catalog
databrickslabs/overwatch
Capture deep metrics on one or all assets within a Databricks workspace
awslabs/amazon-kinesis-data-generator
A UI that simplifies testing with Amazon Kinesis Streams and Firehose. Create and save record templates, and easily send data to Amazon Kinesis.
databrickslabs/migrate
Old scripts for one-off ST-to-E2 migrations. Use "terraform exporter" linked in the readme.
BenWilson2/ML-Engineering
Reference code base for ML Engineering, Manning Publications
aws-samples/amazon-kinesis-analytics-streaming-etl
Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics
garystafford/kafka-connect-msk-demo
For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR
databrickslabs/databricks-sync
An experimental tool to synchronize source Databricks deployment with a target Databricks deployment.
LearningJournal/Spark-Streaming-In-Scala
Apache Spark 3 - Structured Streaming Course Material
AbePabbathi/lakehouse-tacklebox
This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.
alexott/dlt-files-in-repos-demo
Demonstration of using Files in Repos with Databricks Delta Live Tables
MrPowers/beavis
Pandas helper functions
andyweaves/databricks-audit-logs
aws-samples/amazon-kinesis-data-analytics-for-pyflink-applications
Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.
hwang-db/tf_aws_deployment
Terraform patterns for aws deployments and aws Databricks
alexott/anomaly_detection_using_databricks
PacktPublishing/Real-time-stream-processing-using-Apache-Spark-3-for-Python-developers
Apache Spark 3 - Structured Streaming Course Material
aws-samples/aws-kcl-java
yokawasa/kinesis-consumer
Sample KCL 2.X consumer for AWS Kinesis streams. The consumer is configurable via environmental variables and can be containerized (dockerfile for it is provided), which can be run anywhere
adnanalvee/training-kit
Open source courseware for Git and GitHub
NashTech-Labs/aws-kinesis-consumer
stephenoffer/databricks_terraform_hub_spoke
Hub and Spoke Template for Databricks Terraform Deployment