/ShallowNLP

Shallow techniques for natural language processing

Primary LanguagePython

README

This is the homework repository for LING570: Shallow techniques for NLP

HW1

A simple word tokenizer using Regex

HW2

DFA and NFA acceptors

HW3

FSA acceptor using Viterbi algorithm

HW4

An expanded FSM given a lexicon and morphotactic rules expressed by an FSA

HW5

N-gram language models

HW6

A 2-gram HMM and a 3-gram HMM

HW7

Viterbi algorithm for HMM

HW8

A feature vector builder which reads raw text data and generates feature vectors.

HW9

A system which reads and processes text data, generates features used in MaxEnt.