A repo to assimilate databricks
Place in Codespace secrets
DATABRICKS_HOST
DATABRICKS_TOKEN
databricks clusters list --output JSON | jq
databricks fs ls dbfs:/
databricks jobs list --output JSON | jq
sql remote https://docs.databricks.com/integrations/bi/jdbc-odbc-bi.html#connection-details-cluster
An alternative solution to Databricks is https://tutorial.dask.org/00_overview.html[Dask] or Ray.
- Download data from Kaggle and upload by right-click on explorer in GitHub Codespaces
- place in a "datasets" directory and add this directory to your
.gitignore
. This ensures you don't check in a 1GB file to GitHub.
Enable enron...
streamlit hello --server.enableCORS=false
streamlit run hello_streamlit_enron.py --server.enableCORS=false