Process files from the Old Bailey on-line project. Convert the files (obtained from the University of Giessen into "raw" NAF format.
- Download
OldBaileyCorpus2.zip
and unpack it somewhere. It contains a sub-directory
OBC2
with \XML{} encoded reports of the sessions held in the Old Bailey. - Set the following environment parameters:
corpusdir : Path to the
OBC2
directory nafdir : Path to the directory whre the NAF files ought to go. - Make sure that
nafdir
exists. - run
python bailey_to_naf.py