tannineo/cs7is3-team4

Parsing the corpora

Opened this issue · 5 comments

Find the resource for the test data.

I renamed them in batch under OSX.

The SGML files are using *.sgm.
DTD files are all using *.dtd.
Text readmes are using *.txt.

Don't use the parsed json...