/Fall2017-project4-grp5

fall2017-project4-fall2017-project4-group5 created by GitHub Classroom

Primary LanguageR

Project 4: Collaborative Filtering

screenshot

Term: Fall 2017

  • Team 5

  • Project title: Collaborative Filtering

  • Team members

  • Project summary: In this project, we implemented two algorithms: memory-based algorithm, including Similarity Weight, Selecting Neighbours and Rating Normalization and model-based algorithm, including Cluster Models to conduct Collaborative Filtering. And we used two datasets, Anonymous Microsoft Web Data and EachMovie Dataset to evaluate and compare a pair of algorithms for collaborative filtering (CF). For evaluation part, compare the performance for these different algorithms and component combinations using ranked scoring for dataset 1 and mean absolute error (MAE) and ROC sensitivity for dataset 2.

  • Project details: In the memory-based algorithm: First, we computed different similarity weights. Then, we selected best-n neighbors based on their weights. Last, we predicted ratings using z-score. In the model-based algorithm, we used cluster model to do Collaborative Filtering.

  • Results: We found memory-based model with pearson weight + significance weighting + best-n + z-score produced the highest R-score.

screenshot

Contribution statement: (default) All team members contributed equally in all stages of this project. All team members approve our work presented in this GitHub repository including this contributions statement. And the most important thing is all of our members worked very hard to complete our project.

Following suggestions by RICH FITZJOHN (@richfitz). This folder is orgarnized as follows.

proj/
├── lib/
├── data/
├── doc/
├── figs/
└── output/

Please see each subfolder for a README file.