/NLP-MTECH

Primary LanguagePython

NLP-MTECH

Mtech Summer Sem NLP Project, TOPIC : Perform various techniques (Data cleaning, data reduction, pre-processing, feature selection, feature engineering, anomaly deteciton, data visualization, data type convertion, handling missing data, filter wanted outliers) on any text data.

PREPROCESSING DONE -> lowercasing, Stemming, Removing Stopwords, Lemmatization, Part-of-Speech Tagging, finally tokenization.

CLASSIFICATION TECHINQUES USED -> Term Frequency-Inverse Document Frequency (TF-IDF), Support Vector Machine (SVM)

DATA VIZUALIZATION TECHNIQUES USED -> Topic Modeling Visualization, Word Cloud, Box Plots.