This repository consists of basic webmining and text processing algorithms. Using nltk toolkit for text processing and pandas for storage in a csv file. This was used for learning the basics of web mining so as get a firm foothold in my class.
Uses Stemming, Stopword removal for the preprocessing.
- NLTK
- Pandas
- NumPy