See decode.py and, for the stretch goal, evaluate.py.

Sample evaluation output:

Word accuracy:		98.57%
Sentence accuracy:	78.00%