pdf-raku/PDF-raku

Text line/word breaking wrt Standard Annex #14

Closed this issue · 2 comments

http://unicode.org/reports/tr14/ contains some useful info that can be used to improve and better generalize word/line breaking in the $.page.text() method.

Without going overboard, the text breaking method could make use of the non-breaking classes, break opportunities and better handle numeric context.

text handling exported to upstream PDF::DOM. Will reopen there.

err, david isn't that downstream?