Databricks
Helping data teams solve the world’s toughest problems using data and AI
United States of America
Pinned Repositories
click
The "Command Line Interactive Controller for Kubernetes"
dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
jsonnet-style-guide
Databricks Jsonnet Coding Style Guide
koalas
Koalas: pandas API on Apache Spark
learning-spark
Example code from Learning Spark book
LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
megablocks
scala-style-guide
Databricks Scala Coding Style Guide
spark-deep-learning
Deep Learning Pipelines for Apache Spark
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
Databricks's Repositories
databricks/jsonnet-style-guide
Databricks Jsonnet Coding Style Guide
databricks/spark-integration-tests
Integration tests for Spark
databricks/spark-pr-dashboard
Dashboard to aid in Spark pull request reviews
databricks/ide-best-practices
Best practices for working with Databricks from an IDE
databricks/diviner
Grouped time series forecasting engine
databricks/drunken-data-quality-1
Spark package for checking data quality
databricks/spark-package-cmd-tool
A command line tool for Spark packages
databricks/upload-dbfs-temp
databricks/simple-pipeline
Example pipeline for bit.io
databricks/xgboost-linux64
Databricks Private xgboost Linux64 fork
databricks/jenkins-job-builder
Fork of https://docs.openstack.org/infra/jenkins-job-builder/ to include unmerged patches
databricks/weld
High-performance runtime for data analytics applications
databricks/incubator-airflow
Apache Airflow (Incubating)
databricks/blobfuse-fork-public
A virtual file system adapter for Azure Blob storage
databricks/jackson-module-scala
Add-on module for Jackson (https://github.com/FasterXML/jackson) to support Scala-specific datatypes
databricks/jenkins-job-builder-addons
Addons for jenkins job builder
databricks/json-bigint
JSON.parse/stringify with bigints support
databricks/nailgun
Nailgun is a client, protocol, and server for running Java programs from the command line without incurring the JVM startup overhead.
databricks/pyodbc
Python ODBC bridge
databricks/SnpEff
Databricks snpeff fork
databricks/examples
Examples of using bit.io with various programming languages
databricks/azure-relay-node
☁️Node.js library for Azure Relay Hybrid Connections
databricks/bazel-toolchain
LLVM toolchain for bazel
databricks/ec2-plugin
Jenkins ec2 plugin
databricks/jarjar
Jar Jar Links is a utility that makes it easy to repackage Java libraries and embed them into your own distribution.
databricks/jetty.project
Eclipse Jetty® - Web Container & Clients - supports HTTP/2, HTTP/1.1, HTTP/1.0, websocket, servlets, and more
databricks/Lenses
Tiny lenses library with focus on ease of use.
databricks/pileup.js
Interactive in-browser track viewer
databricks/python-apt-mirror-updater
Automated, robust apt-get mirror selection for Debian and Ubuntu
databricks/rules_proto
Modern bazel build rules for protobuf / gRPC