austin226/ling573

Implement document reader

Closed this issue · 2 comments

Unless @corbettmoore was going to do this as part of content selection, I'd like to implement the DocReader class that converts the initial XML file into a set of document texts.

If you want to do that part, go for it. If there's something special about the format, like the timestamp, let me know.