/Sentimental_Analysis_on_Movie_Comments

Sentimental Analysis on Movie Comments for ISRP in UCSD

Primary LanguagePython

Sentimental Analysis on Movie Comments

A project for International Summer Research Program in University of California, San Diego

  • A contest on Kaggle
  • Using Linear Regression
  • With Natural Language Processing method, such as stemming and bi-gram

Methods

  1. Prepocess - Lowercase, remove punctuation, do stemming, filter out stopwords.
  2. Feature - (1) Count the frequency of each word. (2) Keep the highest frequency n words. (3) Each vector of features represents the sentence.
  3. Machine Learning - Linear Regression. The model will consist of n size theta.
  4. Predict.