/SPACCC

[PlanTL/medicine/document] Spanish Clinical Case Corpus

OtherNOASSERTION

SPACCC: Spanish Clinical Case Corpus

This repository contains the Spanish Clinical Case Corpus.

The SPACCC corpus was created after collecting 1,000 clinical cases from SciELO (Scientific Electronic Library Online), an electronic library that gathers electronic publications of complete full text articles from scientific journals of Latin America, South Africa and Spain (http://www.scielo.org).

A clinician classified those cases into those that were similar to real clinical texts in terms of structure and content and those that were not suitable for this task. Figure legends were automatically removed and in case multiple clinical cases were listed, these were split into single clinical cases.

Digital Object Identifier (DOI) and access to dataset files

https://doi.org/10.5281/zenodo.2560316

Contact

Ander Intxaurrondo (ander.intxaurrondo@bsc.es)

License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright (c) 2018 Secretaría de Estado para el Avance Digital (SEAD)