Moviereview

The dataset is available in kaggle. dataset name- labelledtrain #preprocessingstages

Pre processing includes removing contractions,stemming,lematization etc to convert the raw words to vector. #language modelling

Language model is required to represent the text to a form understandable from the machine point of view.It includes trigrams and bigrams