This python CLI generates a Databricks archive from a source application so that it can run in Databricks without needing to install a whl file on a cluster. This is useful for deploying applications to Databricks that are not part of the Databricks ecosystem, or for deploying applications that are not part of the Databricks ecosystem. It is also for users that do not have the ability to install packages on a Databricks cluster.
Install dependencies with poetry:
poetry install
Beyond dependencies and a sample application nothing further is needed.
poetry run pytest
Build the databricks archive file
py2databricks build --entrypoint <entrypoint_module> --output_name databricks_archive.zip
Print the dependency tree for the application
py2databricks print_dependency_tree --entrypoint <entrypoint_module>