/CRAFT

Primary LanguageClojureOtherNOASSERTION

The Colorado Richly Annotated Full-Text (CRAFT) Corpus

This repository contains the CRAFT corpus, a collection of 97 articles from the PubMed Central Open Access subset, each of which has been annotated along a number of different axes spanning structural, coreference, and concept annotation.

Citing CRAFT

To cite the CRAFT corpus, please see the CRAFT Reference wiki page.

Using CRAFT

For installation and other usage instructions, please see the CRAFT Wiki.

Stable releases

For stable releases, please download from the CRAFT Releases page.

Creating alternative file formats

The distribution has been streamlined to include only a single file format for each annotation type. In place of multiple file formats for each annotation type, the CRAFT corpus is distributed with a script which can convert annotations from the native file format into a variety of other file formats. Please see the Creating alternative annotation file formats wiki page for details.

Feedback

Please direct comments, questions, and suggestions to the Issues section of the CRAFT GitHub page, or send e-mail to Mike Bada at mike.bada@ucdenver.edu.