forked from https://github.com/danielsc/azureml-and-dask
Dask + Azure ML = OSS Data Science & ML @ Scale.
AzureMLCluster
makes it easy to setup a powerful Dask cluster for interactive work.
There are a few options for starting a Dask cluster on Azure ML Compute.
See local for letting dask-cloudprovider
take care of creating (and deleting) the Azure ML Compute Target on your behalf. You can specify a vm_size
and other parameters.
This is recommended to avoid vnet and SSH setup.
If you need your Compute Target on a vnet, it is recommended to use an Azure ML Compute Instance deployed within the same vnet for creating a Dask cluster.
See ct-vnet for details:
- 01.setup-compute.ipynb can be run from anywhere and will create a vnet, Compute Instance on the vnet, and Compute Target on the vnet
- 02.start-cluster.ipynb must be run on the Compute Instance in the vnet and creates a Dask cluster from the Compute Target
If you'd otherwise like to manage the Compute Target, you need to enable SSH when creating it to allow for tunneling to connect to the Dask cluster.
See ct-ssh-no-vnet for details:
- 01.start-cluster.ipynb can be run from anywhere and will create a pair of SSH public and private keys, Compute Target using the SSH public key, and Dask cluster using the SSH private key