This project implements a linear classifier for sentimental analysis purposes on product reviews.
The linear classifier takes advantage of:
Use the use_bigram
in main.py
to set the language model as bigram (True
) or unigram (False
).
Set the remove_stopwords
variable to choose to remove the predefined stopwords (e.g. i, me, my, myself, etc.) from the extracted dictionary and the generated feature matrices.
Run the test.py
python script for basic validation tests.
Run the main.py
python script to see the results.