Pinned Repositories
databricks-sql-clone
Python script to clone SQL dashboard from one workspace to another
dbx
CLI tool for advanced Databricks jobs management.
geoscan
Geospatial clustering at massive scale
migrate
Scripts to help customers with one-off migrations between Databricks workspaces.
mleap
MLeap: Deploy Spark Pipelines to Production
tempo
The purpose of this project is to provide an API for manipulating time series on top of Apache Spark. Functionality includes featurization using lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, and downsampling & interpolation. This has been tested on TB-scale of historical data and is unit tested for quality purposes.
security-analysis-tool
Security Analysis Tool (SAT) analyzes customer's Databricks account and workspace security configurations and provides recommendations that help them follow Databrick's security best practices. When a customer runs SAT, it will compare their workspace configurations against a set of security best practices and delivers a report.
terraform-databricks-sra
The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.
splunk-integration
Databricks Add-on for Splunk
databricks-sql-clone
Python script to clone SQL dashboard from one workspace to another
arunpamulapati's Repositories
arunpamulapati/databricks-sql-clone
Python script to clone SQL dashboard from one workspace to another
arunpamulapati/dbx
CLI tool for advanced Databricks jobs management.
arunpamulapati/geoscan
Geospatial clustering at massive scale
arunpamulapati/migrate
Scripts to help customers with one-off migrations between Databricks workspaces.
arunpamulapati/mleap
MLeap: Deploy Spark Pipelines to Production
arunpamulapati/tempo
The purpose of this project is to provide an API for manipulating time series on top of Apache Spark. Functionality includes featurization using lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, and downsampling & interpolation. This has been tested on TB-scale of historical data and is unit tested for quality purposes.