nipunsadvilkar/pySBD

"unbalanced parenthesis" error

dumitrescustefan opened this issue · 0 comments

I'm getting this error every once in a while, for example, for text:

'Remuneration Report. 9.8.4(5) Directors? (the Company) See paragraph headed ?Capital structure? in this report. 9.8.4(8) Non-pro-rata allotments of equity for cash (major subsidiaries) N/A 9.8.4(10) Contracts of significance involving a Director N/A 9.8.4(11) Contracts of significance involving a controlling shareholder N/A 9.8.4(12) Waivers of dividends N/A 9.8.4(13) Waivers of future dividends N/A 9.8.4(14) Agreement with a controlling shareholder (LR 9.2.2.AR(2)(a)) See Corporate'

I get: unbalanced parenthesis at position 14

This is how I instantiate and run:
segmenter = pysbd.Segmenter(language="en", clean=True, doc_type="pdf")
sentences = segmenter.segment(txt)

where txt is the string above.
Thanks!