This project presents NEREL-BIO -- an annotation scheme and corpus of PubMed abstracts in Russian and in English. NEREL-BIO extends the general domain dataset NEREL. NEREL-BIO annotation scheme covers both general and biomedical domains making it suitable for domain transfer experiments.
No. | Entity type | No. | Entity type | No. | Entity type |
---|---|---|---|---|---|
1. | ACTIVITY | 14. | MEDPROC | 27. | MONEY |
2. | ADMINISTRATION_ROUTE | 15. | MENTALPROC | 28. | NATIONALITY |
3. | ANATOMY | 16. | PHYS | 29. | NUMBER |
4. | CHEM | 17. | SCIPROC | 30. | ORDINAL |
5. | DEVICE | 18. | AGE | 31. | ORGANIZATION |
6. | DISO | 19. | CITY | 32. | PERCENT |
7. | FINDING | 20. | COUNTRY | 33. | PERSON |
8. | FOOD | 21. | DATE | 34. | PRODUCT |
9. | GENE | 22. | DISTRICT | 35. | PROFESSION |
10. | INJURY_POISONING | 23. | EVENT | 36. | STATE_OR_PROVINCE |
11. | HEALTH_CARE_ACTIVITY | 24. | FAMILY | 37. | TIME |
12. | LABPROC | 25. | FACILITY | ||
13. | LIVB | 26. | LOCATION |
https://arxiv.org/abs/2210.11913
@article{loukachevitch2022nerel,
title={NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities},
author={Loukachevitch, Natalia and Manandhar, Suresh and Baral, Elina and Rozhkov, Igor and Braslavski, Pavel and Ivanov, Vladimir and Batura, Tatiana and Tutubalina, Elena},
journal={arXiv preprint arXiv:2210.11913},
year={2022}
}