Applied Statistical Learning in Python

KKU Datathon 2018
Mon, 17 September 2018 (1-4PM)
Calvin J Chiew, Arporn Wangwiwatsin

Description

This workshop aims to introduce clinicians to popular statistical methods used in machine learning, without delving into the underlying mathematical theory. We will focus on the random forest and support vector machine for classification, as well as general concepts of model fit and cross-validation. In the hands-on exercise, you will be asked to implement and evaluate these models on a clinical prediction problem. No prior programming experience is assumed. Basics of the Python language and Jupyter notebook environment will be covered.

Requirements

Laptop
Anaconda 5.0.1 (Python 3.6 version) installed

Instructions

Download the installer for Anaconda (Python 3.6 version) on your respective OS (Windows/Mac/Linux) from https://www.anaconda.com/download/.
Run the installer and use all default options when prompted.
After installation is complete, make sure you can open Anaconda Navigator and launch Jupyter notebook.

Materials

No.	Section	Links
(1)	Lecture	Slides, Handout
(2)	Python	Dataset, Notebook, Solutions
(3)	Sample	Dataset, Notebook
(4)	Exercise	Dataset, Notebook, Solutions

Resources

An Introduction to Statistical Learning with Applications in R by G James, D Witten, T Hastie & R Tibshirani (Springer 2013)

calvinjchiew/kku18