W3C home > Mailing lists > Public > public-clreq-admin@w3.org > July to September 2015

RE: Putting Word-breaking in CLReq?

From: HU, Chunming <hucm@w3.org>
Date: Fri, 24 Jul 2015 20:33:33 +0800
To: "'Xiaoqian Wu'" <xiaoqian@w3.org>, <public-clreq-admin@w3.org>
Message-ID: <00c801d0c60c$f858a2d0$e909e870$@w3.org>

? really?



From: public-clreq-admin-request+bounce-hucm=w3.org@listhub.w3.org [mailto:public-clreq-admin-request+bounce-hucm=w3.org@listhub.w3.org] On Behalf Of Xiaoqian Wu
Sent: Friday, July 24, 2015 6:02 PM
To: public-clreq-admin@w3.org
Subject: Putting Word-breaking in CLReq?


In case I forget about this in the next meeting, here’s a request about word-breaking and the relevant discussion. Word breaking is important for the Selection and Editing APIs. Shall we provide some brief answers to this topic in the CLReq?


Q: Does anyone know of character level mechanisms used to advise alogrithms of the word boundaries (or lack of boundaries) in Chinese text?



From: Li Songfeng

中文正文断词除了标点不能位于行首以及单字不成行(一个字不能占一行)、孤行控制(分页情况下,一段第一行出现在页尾或最后一行出现在页首 )外,就想不起来其他规则了。中西文、数字混排会更复杂。中文标题如果太长需要折行,的确有构词的问题,比如“……的……”中的“的”不能出现在下一行行首。


From: Zhang Kun






Received on Friday, 24 July 2015 12:33:50 UTC

This archive was generated by hypermail 2.3.1 : Friday, 24 July 2015 12:33:50 UTC