/Machine-Learning-Starter-Kit

The fastest way for developers and managers to gain practical ML knowledge and to apply it to their own projects.

Primary LanguageJupyter Notebook

Machine Learning Starter Kit

The Machine Learning Starter Kit contains resources for developers and managers to gain practical ML knowledge and to learn how to apply it to their own projects.

There is also an accompanying email course with additional, exclusive and free content to help you get started progressively with ML — sign up if you haven’t done so already!

Contents of the Kit

  • Bootstrapping Machine Learning 1.0.5 sample.pdf (Introduction to Machine Learning taken from my book)
  • When Machine Learning fails.pdf
  • How to improve your business by predicting churn.pdf (Practical & code-free tutorial)
  • Machine Learning Canvas v0.4.pdf (blank template to formalize your own ML use case)
  • From Data to AI with the ML Canvas.pdf: a preliminary guide to using the Canvas
  • code/
    • Machine Learning with BigML API - Interactive Code Tutorial.html (Link to the notebook above hosted on Wakari)
    • Machine Learning with BigML API - Interactive Code Tutorial.ipynb (Jupyter notebook in Python on the basics of BigML — see set up instructions below or use the version hosted on Wakari)
    • AmazonML-Python.ipynb

Datasets

Below is a list of real-world datasets that you can use to experiment with ML. I’m not including the actual datasets in this kit in order to make it smaller in size. You can download the datasets you’re interested in via http (links provided) or you can point to them from BigML or Amazon ML using the S3 paths given between brackets.

Creating accounts on cloud ML platforms

  • Create an account on BigML. Usage of the service is free in “dev” mode.
  • Create an Amazon Web Services account in order to use Amazon ML (AML). Unfortunately there is no free trial, so if you want to use AML you will need to give your credit card details. That being said, usage is very cheap for basic experiments (see pricing: $0.42 per hour plus a few cents for predictions). If you want to use AML, you will need to go through identity verification and to make sure you can list “entities” on the AML dashboard.

Both services can be used with graphical, web-based interfaces. I recommend you to write down your access credentials somewhere safe (I use LastPass) and to use an up-to-date version of the Chrome browser.

Other services

There are many other services you could be using and which could be better fits for your own needs. I recommend you to have a look at the PAPIs conferences (past events and videos) to learn more.

Running the notebooks

See code/README.md

Continue your journey in Machine Learning

Part of this Kit is based on my book, Bootstrapping Machine Learning, where you’ll find more details and resources to go further.

Chris Bourguignat (Data Science Lead at global insurance company Axa) said about the book:

“I really wish it existed when I first learned machine learning”

Don’t wait up and get your copy now!

Copyright

Louis Dorard © 2017 — All Rights Reserved

Follow me on Twitter @louisdorard | Contact me here