Databricks
Helping data teams solve the world’s toughest problems using data and AI
United States of America
Pinned Repositories
click
The "Command Line Interactive Controller for Kubernetes"
dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
jsonnet-style-guide
Databricks Jsonnet Coding Style Guide
koalas
Koalas: pandas API on Apache Spark
learning-spark
Example code from Learning Spark book
LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
megablocks
scala-style-guide
Databricks Scala Coding Style Guide
spark-deep-learning
Deep Learning Pipelines for Apache Spark
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
Databricks's Repositories
databricks/libpg_query_old
C library for accessing the PostgreSQL parser outside of the server environment
databricks/social-app-django
Python Social Auth - Application - Django
databricks/django-allauth
Integrated set of Django applications addressing authentication, registration, account management as well as 3rd party (social) account authentication.
databricks/devbox
databricks/pgpool2
This is the official mirror of git://git.postgresql.org/git/pgpool2.git. Note that this is just a *mirror* - we don't work with issues/pull requests on github. Please visit our web site to file bug reports or submit patches.
databricks/dagster
A data orchestrator for machine learning, analytics, and ETL.
databricks/spark-knowledgebase
Spark Knowledge Base
databricks/Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
databricks/test-infra
Test infrastructure for the Kubernetes project.
databricks/subpar
Subpar is a utility for creating self-contained python executables. It is designed to work well with Bazel.
databricks/brew-install-specific
Find and install specific versions of brew packages
databricks/covid
Documentation on bit.io's Covid Explorer
databricks/benchmarks
A place in which we publish scripts for reproducible benchmarks.
databricks/databricks-accelerators
Accelerate the use of Databricks for customers [public repo]
databricks/spark-sklearn
(Deprecated) Scikit-learn integration package for Apache Spark
databricks/mlflow
Open source platform for the machine learning lifecycle
databricks/spark-sklearn-docs
databricks/mlflow-example-sklearn-elasticnet-wine
databricks/spark-avro
Avro Data Source for Apache Spark
databricks/spark-csv
CSV Data Source for Apache Spark 1.x
databricks/spark-corenlp
Stanford CoreNLP wrapper for Apache Spark
databricks/xgb-regressor
MLflow XGBoost Regressor
databricks/sjsonet-old
databricks/lsbrepo
databricks/build-tooling
Databricks Education department's curriculum build tool chain
databricks/azure-databricks-demos
databricks/spark-perf
Performance tests for Apache Spark
databricks/sbt-databricks
An sbt plugin for deploying code to Databricks Cloud
databricks/spark-salesforce
Spark data source for Salesforce
databricks/knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.