ferristseng/rust-punkt
Implementation of the Punkt sentence tokenizing algorithm in Rust.
RustApache-2.0
Issues
- 3
Transferring crate?
#17 opened by workingjubilee - 0
- 0
- 0
Panic with multibyte string
#13 opened by kfreiman - 0
How to add new training data?
#12 opened by zbrox - 3
- 1
- 0
Relicense under dual MIT/Apache-2.0
#7 opened by emberian - 0
`Default` is an unfortunate choice of name
#6 opened by shepmaster - 2
Is it possible to get byte or character offsets of tokenized words / sentences?
#4 opened by shepmaster - 0
Alter the way objects can be configured
#2 opened by ferristseng - 0
Second sentence seems to be missing
#5 opened by shepmaster - 0
Use string interning
#3 opened by ferristseng - 0