grantjenks/python-wordsegment
English word segmentation, written in pure-Python, and based on a trillion-word corpus.
PythonNOASSERTION
Issues
- 3
Using with Additional corpus of spelling mistakes.
#39 opened by willwade - 2
Support for Other Languages
#32 opened by ykhatami - 1
- 6
RecursionError on segment call
#33 opened by irmo322 - 3
'helloworld' does not segment as expected
#35 opened by Forest216 - 2
- 1
Can I use this from C or C++?
#25 opened by PhilAndrew - 1
Corpus python
#28 opened by reem1122-sys - 2
russian language
#29 opened by vinnitu - 1
Training on new, modern data.
#30 opened by sevmardi - 1
Please allow separation of numbers from text
#26 opened by prabhatM - 3
Text with numbers doesn't segment as expected
#20 opened by sgokhales - 1
`exhilarate` does not segment as expected
#21 opened by mooosu - 1
How to add custom values?
#16 opened by vebsun - 7
License question
#14 opened by kootenpv - 3
- 2
ZeroDivisionError
#13 opened by wcollins-ebsco - 2
import error
#18 opened by ffxz - 1
- 1
max() arg is an empty sequence
#15 opened by desh2608 - 5
Buffering issue in main()
#12 opened by dandelionred - 7
Recursion limit exceeded
#7 opened by ChristosChristofidis - 3
Bigram doesn't work.
#9 opened by moeseth - 1
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 1286: ordinal not in range(128)
#5 opened by chnsh - 4
Only Old word
#6 opened by aongwachi - 2
Training on new data
#2 opened by jagadeeshraja - 1
Prior probability calculation question
#1 opened by badc0re