Issues
- 0
- 7
A dot added in dates
#52 opened by starkadur - 0
- 0
pkg_resources is deprecated
#46 opened by sveinbjornt - 1
- 2
- 1
- 3
Spaces deleted
#21 opened by starkadur - 7
split_into_sentences changes sentences
#24 opened by bnika - 0
Character omitted
#32 opened by starkadur - 3
Adda mánuð
#37 opened by sigurdurb - 0
Support colon-separated duration?
#31 opened by sveinbjornt - 1
Bigger ordinal numbers in the tokenizer
#28 opened by helga-lvl - 3
The tokenizer is missing some abbreviations
#25 opened by helga-lvl - 1
Twitter handles and @usernames can contain periods (@matur.a.mbl) but are broken into sentences
#18 opened by sveinbjornt - 0
Not enough test coverage
#23 opened by peturorri - 1
Bandstrik skilin frá orði
#15 opened by starkadur - 0
Support for citation characters
#16 opened by sveinbjornt - 1
- 3
Detokenization adds spaces to "o.s.frv."
#13 opened by HaukurPall - 1
- 3
- 2
Tokeniize() options
#4 opened by pallih - 1
KeyError for unknown abbreviations
#1 opened by sverrirab