Information Retrieval Basic Techniques

This Sample project demonstrated the basic IR techniques by using python. (tokenization, isolated word correction, context sensitive word correction, Stemming, and Lemmatization)

Data Files Used

  1. Twitter Feed
  2. Student Course Feedback Data
  3. Research Paper

Libraries & modules are required as prerequisites

  1. Installing pyspellchecker as the isolated word corrector pip install pyspellchecker

  2. Installing the symspellpy module pip install -U symspellpy

  3. Download wordnet lemmarizer by using the python console.

        >> import nltk
        >> nltk.download()