databricks

There are 905 repositories under databricks topic.

getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Language:Python26.7k 578 2.5k4.4k
cube-js/cube
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Language:Rust18.1k 162 2.4k1.8k
Tencent/APIJSON
🏆 实时零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码，前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users
Language:Java17.4k 384 6372.2k
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Language:Python10.8k 139 1621.2k
tobymao/sqlglot
Python SQL Parser and Transpiler
Language:Python6.9k 42 1.9k742
microsoft/SynapseML
Simple and Distributed Machine Learning
Language:Scala5.1k 146 733833
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language:Python2.5k 42 24239
delta-io/delta-rs
A native Rust library for Delta Lake, with bindings into Python
Language:Rust2.4k 36 1.2k419
dotnet/spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Language:C#2k 89 567317
Multiwoven/multiwoven
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Language:Ruby1.6k 17 3566
Azure-Samples/modern-data-warehouse-dataops
DataOps for Microsoft Data Platform technologies. https://aka.ms/dataops-repo
Language:Jupyter Notebook600 57 488474
synmetrix/synmetrix
Synmetrix – production-ready open source semantic layer on Cube
Language:JavaScript528 8 4027
databricks/mlops-stacks
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
Language:Python489 25 53170
databricks/terraform-provider-databricks
Databricks Terraform Provider
Language:Go467 33 1.8k397
databrickslabs/dbx
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
Language:Python442 23 415123
thoughtworks/mlops-platforms
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
387 19 931
databricks/databricks-sdk-py
Databricks SDK for Python (Beta)
Language:Python381 20 308127
databrickslabs/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Language:Python378 15 7665
microsoft/nutter
Testing framework for Databricks notebooks
Language:Python290 17 3643
databrickslabs/ucx
Automated migrations to Unity Catalog
Language:Python251 212 1.6k87
databricks/dbt-databricks
A dbt adapter for Databricks.
Language:Python237 23 314121
Azure/azure-event-hubs-spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Language:Scala236 47 320175
databrickslabs/overwatch
Capture deep metrics on one or all assets within a Databricks workspace
Language:Scala230 31 75166
adidas/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Language:Python228 18 239
databricks/terraform-databricks-examples
Examples of using Terraform to deploy Databricks resources
Language:HCL224 77 58135
dataflint/spark
Performance Observability for Apache Spark
Language:TypeScript206 3 820
Azure/azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
Language:Scala202 69 260121
databrickslabs/cicd-templates
Manage your Databricks deployments and CI with code.
Language:Python201 16 42101
CartoDB/analytics-toolbox-core
A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Postgres and Databricks with Spatial Analytics capabilities
Language:JavaScript192 28 4244
databricks/databricks-sql-python
Databricks SQL Connector for Python
Language:Python174 23 19699
aloneguid/stowage
Bloat-free, no BS cloud storage SDK.
Language:C#170 9 1216
lamastex/scalable-data-science
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Language:HTML165 19 191
databricks/cli
Databricks CLI
Language:Go156 19 44660
alexott/databricks-nutter-repos-demo
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
Language:Python150 9 4129
aehrc/VariantSpark
machine learning for genomic variants
Language:JavaScript142 19 12545
buremba/universql
Query Snowflake tables locally with DuckDB, without any need for a running warehouse
Language:Python123 1 56

databricks

getredash/redash

cube-js/cube

Tencent/APIJSON

databrickslabs/dolly

tobymao/sqlglot

microsoft/SynapseML

databricks/dbrx

delta-io/delta-rs

dotnet/spark

Multiwoven/multiwoven

Azure-Samples/modern-data-warehouse-dataops

synmetrix/synmetrix

databricks/mlops-stacks

databricks/terraform-provider-databricks

databrickslabs/dbx

thoughtworks/mlops-platforms

databricks/databricks-sdk-py

databrickslabs/dbldatagen

microsoft/nutter

databrickslabs/ucx

databricks/dbt-databricks

Azure/azure-event-hubs-spark

databrickslabs/overwatch

adidas/lakehouse-engine

databricks/terraform-databricks-examples

dataflint/spark

Azure/azure-cosmosdb-spark

databrickslabs/cicd-templates

CartoDB/analytics-toolbox-core

databricks/databricks-sql-python

aloneguid/stowage

lamastex/scalable-data-science

databricks/cli

alexott/databricks-nutter-repos-demo

aehrc/VariantSpark

buremba/universql