Divide section 3

Question

Divide section 3

r12a opened this issue 8 years ago · 3 comments

Text segmentation
http://w3c.github.io/ilreq/#h_text_segmentation

i think section 3 could be divided into two subsections:

word boundaries
typographic units

Answer 1 · 2017-01-22T19:54:35.000Z

Agree. And I found it confusing to see certain content here (mentioning Unicode code points, characters, extended grapheme clusters) seemingly should belong to section 2 "Indic orthographic syllable boundaries".

There's even such inaccurate and totally duplicating pieces:

A syllable includes a base consonant and any combination of the following characters in the text stream:

sequences of consonants preceded by virama (i.e. conjuncts).

vowel signs

visarga, anusvara or candrabindu.

Answer 2 · 2017-04-24T15:10:26.000Z

@lianghai's point seems to have been fixed in 33e2c15

(when a change is made to the document that fixes a particular issue, it would be helpful to say so in the comment that comes with that commit (including an link to the issue) - it took me a while to figure out why i couldn't find that text, and then to check where it was removed)

The division into subsections is still tbd.

Answer 3 · 2017-09-21T06:49:12.000Z

The section 3 has been divided into two subsections.