This repository contains the CRAFT corpus, a collection of 97 articles from the PubMed Central Open Access subset, each of which has been annotated along a number of different axes spanning structural, coreference, and concept annotation.
To cite the CRAFT corpus, please see the CRAFT Reference wiki page.
For installation and other usage instructions, please see the CRAFT Wiki.
For stable releases, please download from the CRAFT Releases page.
The distribution has been streamlined to include only a single file format for each annotation type. In place of multiple file formats for each annotation type, the CRAFT corpus is distributed with a script which can convert annotations from the native file format into a variety of other file formats. Please see the Creating alternative annotation file formats wiki page for details.
Please direct comments, questions, and suggestions to the Issues section of the CRAFT GitHub page, or send e-mail to Mike Bada at mike.bada@ucdenver.edu.