Issues
- 2
Output file encoding should be set to UTF-8
#19 opened by rxzhangGH - 1
- 2
Long sentences are not being removed apparently
#17 opened by cgr71ii - 5
Tests don't pass
#13 opened by cgr71ii - 9
'charmap' codec can't decode input_test_2.txt
#15 opened by b3ade - 3
Detokenization introduce new error to the data and ignore_detokenization is not working
#16 opened by jgcb00 - 4
Bifixer Indexerror: list index out of range
#7 opened by jokinlasa - 2
Bifixer doesn't work with new ftfy >=6.0
#5 opened by lpla - 1
	 introduces tabs in tsv output
#4 opened by jelmervdl - 2
Obtain a clean output
#3 opened by jgcb00 - 2
Bifixer doesn't see input file
#2 opened by Syrkovski