Eng-Ahmd/GWU_ML-1

Class Materials for DNSC 6314 and 6315, Machine Learning I and II.

Jupyter NotebookMIT

GWU_DNSC 6314 & 6315: Course Outline

Materials for an introduction to machine learning.

Lecture 1: Preliminaries, Feature Engineering and Feature Selection
Lecture 2: Contemporary Linear Model Approaches
Lecture 3: Model Assessment and Selection
Lecture 4: Decision Trees
Lecture 5: Artificial Neural Networks
Lecture 6: Other Estimators: Support Vector Machines (SVM) k-Nearest-Neighbors (kNN), etc.
Lecture 7: Decision Tree Ensembles
Lecture 8: Convolutional Neural Networks
Lecture 9: Clustering
Lecture 10: Dimension Reduction
Lecture 11: Association Rules and Recommendation

Corrections or suggestions? Please file a GitHub issue.

Preliminary Resources

Lecture 1: Preliminaries, Feature Engineering and Feature Selection

_{^{Source: Lecture 1 feature extraction example.}}

Lecture 1 Class Materials

_{^{All notebooks also available in the notebook folder.}}

Lecture 1 Reading

Label, Segment, Featurize: a cross domain framework for prediction engineering
Introduction to Data Mining - Sections 2.2-2.3 (Chapter 2 notes)
Introduction to the Foundations of Causal Discovery - Sections 1-4, and 7

Lecture 1 Links

Lecture 2: Contemporary Linear Model Approaches

_{^{Source: From GLM to GBM: Building the Case For Complexity.}}

Lecture 2 Class Materials

_{^{Notebooks and data also available via GitHub.}}

Lecture 2 Reading

Elements of Statistical Learning:
- Sections 3.1 - 3.4
- Section 4.4
Regularization and variable selection via the elastic net

Lecture 2 Links

h2o (Python or R download, requires Java)
Generalized Linear Model (GLM) documentation
Generalized Linear Modeling with H2O
elasticnet (R)
glmnet (R)

Lecture 3: Model Assessment and Selection

_{^{Source: From Lecture 3.}}

Lecture 3 Class Materials

_{^{Notebooks and data also available via GitHub.}}

Lecture 3 Reading

Elements of Statistical Learning:
- Sections 7.1 - 7.5
- Section 7.10
Introduction to Data Mining:
- Sections 3.4 - 3.6
KDD-Cup 2004: Results and Analysis

Lecture 4: Decision Trees

_{^{Source: Machine Learning for High-Risk Applications.}}

Lecture 4 Class Materials

_{^{Notebooks and data also available via GitHub.}}

Lecture 4 Reading

Introduction to Data Mining:
- Sections 3.1 - 3.3
Elements of Statistical Learning:
- Section 9.2

Lecture 5: Artificial Neural Networks

_{^{Source: Demystifying Deep Learning, SAS Institute.}}

Lecture 5 Class Materials

_{^{Notebooks and data also available via GitHub.}}

Lecture 5 Reading

Introduction to Data Mining:
- Section 4.7
Elements of Statistical Learning:
- Sections 11.3 - 11.7

Lecture 5 Links

Neural Network Zoo
Deep Learning with H2O
Neural Additive Models: Interpretable Machine Learning with Neural Nets (How I would recommend training an ANN for structured data.)
THE MNIST DATABASE

Lecture 6: Support Vector Machines and k-Nearest-Neighbors

_{^{Source: From Assignment 6.}}

Lecture 6 Class Materials

_{^{Notebooks and data also available via GitHub.}}

Lecture 6 Reading

Introduction to Data Mining:
- Section 4.9
Elements of Statistical Learning:
- Sections 12.1 - 12.3

Lecture 7: Decision Tree Ensembles

_{^{Source: From Lecture 7.}}

Lecture 7 Class Materials

_{^{Notebooks and data also available via GitHub.}}

Lecture 7 Reading

Introduction to Data Mining:
- Section 4.10
Elements of Statistical Learning:
- Chapter 10
- Chapter 15

Lecture 7 Links

Lecture 8: Convolutional Neural Networks

_{^{Source: From Lecture 8, with thanks to Wen Phan.}}

Lecture 8 Class Materials

_{^{Notebooks are also available via GitHub.}}

Lecture 8 Reading

Introduction to Data Mining:
- Section 4.8
Deep Learning:
- Chapter 9

Lecture 8 Links

Keras

Lecture 9: Clustering

_{^{Source: From Assignment 9 Notebook.}}

Lecture 9 Class Materials

_{^{Notebooks and data are also available via GitHub.}}

Lecture 9 Reading

Introduction to Data Mining:
- Chapter 7, through Section 7.3
Elements of Statistical Learning:
- Section 14.3

Lecture 10: Dimension Reduction

_{^{Source: From Lecture 10 Code Example.}}

Lecture 10 Class Materials

_{^{Notebooks and data are also available via GitHub.}}

Lecture 10 Reading

Lecture 11: Association Rules and Recommendation

Lecture 11 Class Materials

_{^{Notebooks and data are also available via GitHub.}}

Lecture 11 Reading

Introduction to Data Mining, Chapter 5
- Sections 5.1 – 5.3, and 5.7
Elements of Statistical Learning:
- Section 14.5 - 14.6
Introduction to Recommender Systems Handbook