web_mining_basic

This repository consists of basic webmining and text processing algorithms. Using nltk toolkit for text processing and pandas for storage in a csv file. This was used for learning the basics of web mining so as get a firm foothold in my class.

Functionality

Uses Stemming, Stopword removal for the preprocessing.

Libraries

  • NLTK
  • Pandas
  • NumPy