izuna385/Wikia-and-Wikipedia-EL-Dataset-Creator
You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wiki are available!
PythonNOASSERTION
Issues
- 2
wikiextractor bug
#32 opened by ujiuji1259 - 0
Avoid bad anchor insert
#31 opened by izuna385 - 0
Strict SBD for ja-text and en-text.
#29 opened by izuna385 - 0
[WIP] en-wiki dump
#20 opened by izuna385 - 1