deltalake
There are 55 repositories under deltalake topic.
databricks_delta_table_samples
This is a code sample repository for demonstrating how to perform Databricks Delta Table operations.
automatic-happiness
A demo repository for integrating a 3rd party data source (e.g. a data platform exposing its data via APIs) to Apache Superset via Deltalake
EMR_Studio_Delta_Lake
Deltalake examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
ifood-data
Ifood data wrangling with Apache Airflow and Apache Spark running on Kubernetes
delta-lake-dms-cdc
Example application for DMS CDC with Delta Lake and Apache Hudi
treinamento-dataproc-deltalake
Ambiente de treinamento para Dataproc e DeltaLake
wideworldadventure
This repository includes all files that compose the design and unification of the databases AdventureWorks and WideWorldAdventure project.
rust_nextstep
A series of exercises to play with more advanced topics in Rust
glue-docker-image
A custom Glue Docker image
Deltalake
Projeto de engenharia de dados para obtenção de dados, desenvolvimento de um deltalake com o python e análises com o Apache Spark
flight-ml-preprocess-gcp
Continuous flight event data processing using Spark Streaming, Delta Lake storage, deployed on GCP dataproc cluster.
Formula1
Formula1 ADF pipeline
dataops
Small data pipeline with airflow scheduling
lambda-delta-optimize
AWS Lambda function for optimizing Delta tables
taxacco
Проект № 4 для курса "Инженер данных".
Databricks-AWS
Databricks provides a unified, open platform for all your data. It empowers data scientists, data engineers and data analysts with a simple collaborative environment to run interactive and scheduled data analysis workloads.
datastack-playground
A datastack playground; includes Spark, Kafka, Airbyte, etc.
OpenTableFormat.github.io
Website for open table format 🕸
Data-Scientist-learning-path-using-databricks
This is the summary of learning Data Science using Databricks