CONLL 2003 |
News |
DUA |
Sang and Meulder, 2003 |
Easy to find |
NIST-IEER |
News |
None |
NIST 1999 IE-ER |
NLTK data |
MUC-6 |
News |
LDC |
Grishman and Sundheim, 1996 |
LDC 2003T13 |
OntoNotes 5 |
Various |
LDC |
Weischedel et al., 2013 |
LDC 2013T19 |
BBN |
Various |
LDC |
Weischedel and Brunstein, 2005 |
LDC 2005T33 |
GMB-1.0.0 |
Various |
None |
Bos et al., 2017 |
http://gmb.let.rug.nl/data.php |
GUM-3.1.0 |
Wiki |
Several (*2) |
Zeldes, 2016 |
✔ Included here |
wikigold |
Wikipedia |
CC-BY 4.0 |
Balasuriya et al., 2009 |
✔ Included here |
Ritter |
Twitter |
None |
Ritter et al., 2011 |
No split , Train/test/dev split |
BTC |
Twitter |
CC-BY 4.0 |
Derczynski et al., 2016 |
✔ Included here |
WNUT17 |
Social media |
CC-BY 4.0 |
Derczynski et al., 2017 |
✔ Included here |
i2b2-2006 |
Medical |
DUA |
Uzuner et al., 2007 |
http://www.i2b2.org |
i2b2-2014 |
Medical |
DUA |
Stubbs et al., 2015 |
http://www.i2b2.org |
CADEC |
Medical |
CSIRO |
Karimi et al., 2015 |
http://data.csiro.au/ |
AnEM |
Anatomical |
CC-BY-SA 3.0 |
Ohta et al., 2012 |
✔ Included here |
MITRestaurant |
Queries |
None |
Liu et al., 2013a |
http://groups.csail.mit.edu/sls/ |
MITMovie |
Queries |
None |
Liu et al., 2013b |
http://groups.csail.mit.edu/sls/ |
MalwareTextDB |
Malware |
None |
Lim et al., 2017 |
http://www.statnlp.org/ |
re3d |
Defense |
Several (*1) |
DSTL, 2017 |
✔ Included here |
SEC-filings |
Finance |
CC-BY 3.0 |
Alvarado et al., 2015 |
✔ Included here |
Assembly |
Robotics |
X |
Costa et al., 2017 |
X |
WikiNEuRal |
Wikipedia |
CC BY-SA-NC 4.0 |
Tedeschi et al., 2021 |
https://github.com/Babelscape/wikineural |