tatuylonen/wikitextprocessor
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
PythonNOASSERTION
Issues
- 1
- 8
Newlines inside `includeonly` are expanded on our side, but not in wikitext.
#314 opened by kristian-clausal - 2
- 6
- 52
- 9
- 2
`{{43e}}` not expanded
#298 opened by LeMoussel - 2
- 2
assert error at src/parse.py ln 2287
#282 opened by kylefoley76 - 9
Can't parse link nodes contain newline character
#266 opened by xxyzz - 59
Checklist-1 for existing errors.
#226 opened by LeMoussel - 1
non-interpretation of certain {{...}} & [[...]]
#261 opened by LeMoussel - 15
- 21
Template {{Voir homonymes|....}} is misinterpreted
#225 opened by LeMoussel - 17
Presence of spurious text.
#243 opened by LeMoussel - 5
ERROR: unimplemented parserfn PAGESIZE
#223 opened by LeMoussel - 3
- 7
ERROR: unimplemented parserfn filepath
#224 opened by LeMoussel - 6
ERROR: unimplemented parserfn #property
#220 opened by LeMoussel - 3
LUA error in #invoke('Bandeau', 'bandeau')
#216 opened by LeMoussel - 1
Infinite loop during `clean_node()`
#233 opened by LeMoussel - 3
EVOL: Store ID Page in SQLite database file.
#218 opened by LeMoussel - 9
ERROR: unimplemented parserfn #coordinates
#209 opened by LeMoussel - 2
WARNing "unrecognized time syntax in #time ..."
#219 opened by LeMoussel - 0
WARNING: unrecognized time syntax
#211 opened by LeMoussel - 3
- 34
Template class="error" when expanding.
#194 opened by LeMoussel - 2
ERROR: LUA error in #invoke('Biblio', 'lienWeb')
#210 opened by LeMoussel - 2
- 1
mediawiki_languagecodes.get_all_names should return all possible language codes as keys, even if they would have an empty string value?
#201 opened by kristian-clausal - 1
- 2
Template {{refnec|....}} is misinterpreted?
#202 opened by LeMoussel - 11
Template `{{date-|....}}` is misinterpreted?
#199 opened by LeMoussel - 50
How to get text from from templates?
#90 opened by rusg77 - 7
- 8
- 8
Expand template contains itself
#193 opened by xxyzz - 1
`<nowiki />` tag breaks parsing nodes
#180 opened by xxyzz - 2
Module 'ne-conj' not found Lua error
#172 opened by xxyzz - 3
`Can not match` Lua errors in the "ja-usex" Module
#170 opened by xxyzz - 3
- 4
- 3
New version failed on SQLite
#139 opened by ziorufus - 8
<ref> elements (and probably other html-like tags) inside list items can seeminly contain newlines
#86 opened by kristian-clausal - 2
- 1
- 0
- 4
Filenames cause errors on exFAT
#63 opened by brendanedwardgavin - 2
Cannot import 'Page'
#61 opened by thelahunginjeet - 1