/Data-Learning

This repository provides the code for the subject Data Learning, which forms part of the course Big Data and Social Media taught in HU Utrecht, translated into Python.

Primary LanguageJupyter NotebookMIT LicenseMIT

Data Learning

This repository provides the code for the subject Data Learning, taught by Aletta Smits at HU Utrecht, translated into Python.

Course Structure

1. Clustering

  • Kmeans.
  • Dummy Variables.
  • Normalization.

2. Classification

  • Knn classification of Iris dataset.

3. Recommender Systems

  • Who is my buddy?
  • Movie recommender system.

4. Decision Tree and Random Forests

  • Decision Tree Classifier.
  • Random Forest Classifier.

5. Regex

  • Clean hastags.
  • Clean Mentions.

6. Forecasting