LanguageMachines/PICCL
A set of workflows for corpus building through OCR, post-correction and normalisation
PythonNOASSERTION
Issues
- 2
- 0
how to create a corpus for a new language
#64 opened by ehrenmann1977 - 1
linkstrings stage fails
#63 opened by proycon - 2
Add text markup information after FoLiA-correct
#62 opened by proycon - 0
- 4
- 0
Pipeline is slower for files which are combined (input files with same prefix)
#36 opened by peterdekker - 1
running piccl to correct words in a simple wordlist
#58 opened by Irishx - 1
more elegant error handling needed
#59 opened by Irishx - 12
PICCL pipelines need to do better input validation and provide better error/warning messages to the user + general lack of documentation needs to improve
#37 opened by willstout - 7
FoLiA alignments in OCR output
#44 opened by proycon - 2
- 9
- 2
- 1
[webservice] Error on adding lexicon from input sources and no inputtemplate for adding custom lexicon
#56 opened by proycon - 4
TICCL-chain --alph option
#46 opened by proycon - 3
- 0
- 2
Files with spaces in filename not handled
#24 opened by bloemj - 13
- 0
Missing output files expected by foliacorrect
#52 opened by peterdekker - 6
- 15
Plain text processing does not work as expected?
#23 opened by proycon - 2
- 18
Autosearch forwarder gives server error
#51 opened by peterdekker - 4
TICCL fails when starting with Folia as input [make TICCL work with non-OCR text classes]
#48 opened by peterdekker - 2
- 0
- 2
- 0
Small mistake in example command
#41 opened by marijnschraagen - 17
"Process `ticclunk (1)` terminated with an error exit status (134)" from ticcl.nf
#39 opened by willstout - 4
- 3
No zip file generated in webinterface
#38 opened by peterdekker - 4
- 10
Fix travis tests
#16 opened by proycon - 10
frog.nf cannot find frog xml output
#29 opened by peterdekker - 2
ocr.nf cannot find FoLiA-hocr output files
#30 opened by peterdekker - 10
TICCL fails on empty unknown words file
#34 opened by peterdekker - 2
- 3
- 3
Frog does not honour --skip option
#28 opened by zeusttu - 7
- 0
Refactor Frog pipeline for performance
#27 opened by proycon - 0
Push PICCL/LaMachine image to Docker Hub
#26 opened by proycon - 2
- 2
- 17
- 2
Sth. wrong with default DEU-frak lexicon
#18 opened by martinreynaert - 2
Web version: PDF upload options
#17 opened by martinreynaert - 1
Downloading books/docs from a URL
#19 opened by martinreynaert