/dbconnect-plotly

dbconnect demo with Plotly

Primary LanguagePythonOtherNOASSERTION

Databricks Connect ❤️ Data Applications

This is a sample app on how to show how easy it is to build a new application using Databricks Connect and Plotly.

From DBR 13 onwards, Databricks Connect is now built on open-source Spark Connect. Spark Connect introduces a decoupled client-server architecture for Apache Spark™ that allows remote connectivity to Spark clusters using the DataFrame API and unresolved logical plans as the protocol. With this new architecture based on Spark Connect, Databricks Connect becomes a thin client that is simple and easy to use! It can be embedded everywhere to connect to Databricks: in IDEs, Notebooks and any application, allowing customers and partners alike to build new (interactive) user experiences based on their Databricks Lakehouse!

All you need to get started is a Databricks cluster and this simple Python application. The dataset used in this application is the standard Databricks samples dataset.

To get started, create a new virtual environment and install the reuired dependencies

pip install -r requirements.txt

To run the application, make sure that you have created a profile for your workspace in the ~/.databrickscfg according to the documentation and crearted a cluster with at least Databricks Runtime 13.0 and above.

In the app.py file configure the values for PROFILE, CLUSTER_ID accordingly.

from databricks.connect.session import DatabricksSession as SparkSession
from databricks.sdk.core import Config

config = Config(profile="PROFILE", cluster_id="CLUSTER_ID")
spark = SparkSession.builder.sdkConfig(config).getOrCreate()

Now, run the plotly app

python app.py

Screenshot

Dependencies

This sample application is meant for illustration purposes only. The application uses the follwing third-party dependencies: