Mtech Summer Sem NLP Project, TOPIC : Perform various techniques (Data cleaning, data reduction, pre-processing, feature selection, feature engineering, anomaly deteciton, data visualization, data type convertion, handling missing data, filter wanted outliers) on any text data.
PREPROCESSING DONE -> lowercasing, Stemming, Removing Stopwords, Lemmatization, Part-of-Speech Tagging, finally tokenization.
CLASSIFICATION TECHINQUES USED -> Term Frequency-Inverse Document Frequency (TF-IDF), Support Vector Machine (SVM)
DATA VIZUALIZATION TECHNIQUES USED -> Topic Modeling Visualization, Word Cloud, Box Plots.