databricks/databricks-vscode

[Improvement] Warn about (large) workspace synchronization

othonc-db opened this issue · 0 comments

Describe the bug improvement

When using the vscode extension for the first time, I had a very large workspace open on vscode (this repo). I just wanted to try to run one example from my Spark studies. This resulted in my entire workspace (~400MB) being synchronized to Databricks.

As a new user, I expect to be warned about this beforehand and to have the option to either (a) configure what I want to sync or at least (b) narrow down my workspace.

This also raises some security concerns, that my local files will be all uploaded to Databricks. What if it contains sensitive files such as access keys or PII data?

To Reproduce
Steps to reproduce the behavior:

  1. Install extension
  2. Have a "large workspace" open on vscode
  3. Connect to Databricks, attach to cluster
  4. Try to run a python script
  5. A long sync process starts

Screenshots
image

System information:

  1. vscode version: September 2023 (version 1.83)
  2. extension version: Release: v1.1.5

Databricks Extension Logs
Please attach the databricks extension logs

Additional context
Add any other context about the problem here.