I was one of the 300 MLEMAND recipients chosen from the 3,000 participants in the Machine Learning Scholarship Program for Microsoft Azure. This repo contains a collection of project deliverables from my MLEMAND course work.
Project notebooks are stowed in secured storage to discourage improper access. For legitimate needs, please email to request access.
- Build and optimize an Azure ML model by training two models, one using Scikit-learn algorithm connected to
Azure HyperDrive
(a hyperparameter tuning engine), and the other, usingAzure AutoML
automated training process. The optimized or best performing model is chosen from the two approaches. The workflow is as shown below (Image credit: Udacity MLEMA Nanodegree):
The Bank Marketing dataset from UCI ML Repository was used to train the two models.
Project specification: refer to this rubric
Project code and artifact repository: click here
Tool: Azure ML Studio, Azure Python SDK, Jupyter Notebook
Artifact: udacity-project.ipynb, train.py, README.md
- Use this Bank Marketing dataset to configure an Azure cloud-based machine learning production model, deploy it, and consume it. Also create, publish and consume a pipeline from the training experiment run. Finally, create a README file and a screencast video to demonstrate the working ML model. The workflow is as shown below:
Project specification: refer to this rubric
Project code and artifact repository: click here
Tool: Azure ML Studio, Azure Python SDK, Jupyter Notebook, Apache Benchmarking tool, Swagger
Artifact: aml-pipelines-with-automated-machine-learning-step.ipynb, train.py, README.md, screencast of the working ML model
- Create two models: one using Automated ML (denoted as AutoML) and one customized model whose hyperparameters are tuned using HyperDrive. Compare the performance of both the models and deploy the best performing model as a web service. The workflow is as shown below:
Use a dataset external to the Azure ML ecosystem to train the models, e.g. from Kaggle, UCI ML repository, or any other open-source data repositories. The dataset format must be supported by Azure ML's
automl
API.Project specification: refer to this rubric
Project code and artifact repository: click here
Tool: Azure ML Studio, Azure Python SDK, Jupyter Notebook
Artifact: automl.ipynb, hyperparameter_tuning.ipynb, train.py, conda_env.yml, score.py, screencast of the working ML model