Repository for basic applied NLP tasks for an introduction to:
- NLTK notes: Basic NLTK reference, containing excerpts from the NLTK book
- NLTK - Comparing pronoun and modal verb language use: Basic codes for counting pronoun and modal verb usage in text, and how to use NLTK to do this faster.
- Zipf's Law: Zipf's law states that given a large sample of words used, the frequency of any word is inversely proportional to its rank in the frequency table. So word number n has a frequency proportional to 1/n. This file explores this empirical law practically.
- Analogy Prediction: Using pre-trained word vectors for predicting analogies between word pairs based on relations.
- Language Model for machine translation between English, French, Italian and Spanish: Building a crude character level bigram language model to perform translation between languages such as English, French, Italian and Spanish.