/DataLearn-ML-Intro-2019

The repository of the hands-on introduction to machine learning workshop of the DataLearn 2019 track at DataHack 2019.

Primary LanguageJupyter Notebook

DataLearn Supervised ML Intro 2019

The repository of the hands-on introduction to machine learning workshop of the DataLearn 2019 track at DataHack 2019.

Video link: https://youtu.be/Su8YcXgkDsk?t=1701

Meetup event link

Notebooks by Shay Palachy. Presentation by Shay Palachy and Dana Kaner. (Thank you Dana <3)

Resources

Outline

  • Tools of the trade
    • Jupyter notebooks
    • numpy
    • pandas
  • Data exploration
  • Preprocessing
    • Imputation
    • Scaling and normalization
    • Handling outliers
    • Feature extraction/generation
    • Feature selection
    • Dimensionality reduction
  • Modeling
    • Model fit & loss functions
    • Splitting your data
    • Model evaluation
    • Hyperparameter Optimization