Shouldn't colons cause a sentence split?
RuABraun opened this issue · 2 comments
RuABraun commented
They currently don't:
>>> s = 'Tomorrow I will do the greatest thing ever: Become a god.'
>>> seg.segment(s)
['Tomorrow I will do the greatest thing ever: Become a god.']
>>> s = 'The best player of the city: Zob Ahan F.C. and Sepahan F.C..'
>>> seg.segment(s)
['The best player of the city: Zob Ahan F.C. and Sepahan F.C..']
nipunsadvilkar commented
@RuABraun: not by default design choice of pysbd
& pragmatic_segmenter
. Still, if you wish to add :
then update it at the of SENTENCE_BOUNDARY_REGEX
- \S.*?[。..:!!??ȸȹ☉☈☇☄]
.
pySBD/pysbd/lang/common/numbers.py
Line 10 in f3d0ae4
RuABraun commented
Thanks