/pattern_classification

A collection of tutorials and examples for solving and understanding machine learning and pattern classification tasks

GNU General Public License v3.0GPL-3.0

logo


**A collection of tutorials and examples for solving and understanding machine learning and pattern classification tasks.**



Machine Learning and Pattern Classification






Machine learning and pattern classification with scikit-learn

  • Entry Point: Data - Using Python's sci-packages to prepare data for Machine Learning tasks and other data analyses [IPython nb]

  • An Introduction to simple linear supervised classification using scikit-learn [IPython nb]



Techniques for Dimensionality Reduction

  • Projection

    • Component Analyses
  • Feature Selection

    • Sequential Feature Selection Algorithms [IPython nb]



Techniques for Parameter Estimation

  • Parametric Techniques

    • Introduction to the Maximum Likelihood Estimate (MLE) [IPython nb]
    • How to calculate Maximum Likelihood Estimates (MLE) for different distributions [IPython nb]
  • Non-Parametric Techniques

    • Kernel density estimation via the Parzen-window technique [IPython nb]
    • The K-Nearest Neighbor (KNN) technique
  • Regression Analysis

    • Linear Regression

    • Non-Linear Regression




Statistical Pattern Recognition Examples

  • Supervised Learning

    • Parametric Techniques

      • Univariate Normal Density

        • Ex1: 2-classes, equal variances, equal priors [IPython nb]
        • Ex2: 2-classes, different variances, equal priors [IPython nb]
        • Ex3: 2-classes, equal variances, different priors [IPython nb]
        • Ex4: 2-classes, different variances, different priors, loss function [IPython nb]
        • Ex5: 2-classes, different variances, equal priors, loss function, cauchy distr. [IPython nb]
      • Multivariate Normal Density

        • Ex5: 2-classes, different variances, equal priors, loss function [IPython nb] - Ex7: 2-classes, equal variances, equal priors [IPython nb]
    • Non-Parametric Techniques

  • Unsupervised Learning




Links to useful resources



Dataset Collections

  • Kaggle - Kaggle, the leading platform for predictive modeling competitions.

  • UCI MLR - UC Irvine Machine Learning Repository

  • google.com/publicdata - public data maintained by Google

  • Freebase - A community-curated database of well-known people, places, and things



Specialized Datasets