Words and spaces

Hello public-tt,

In section 8.3.7 <flowFunction>

 The dynamic flow unit word must be interpreted as being dependent upon
 the language or writing system of the affected content. If the language
 or writing system is unknown or unspecified, then word is interpreted
 as follows:

   1. If the affected content consists solely or mostly of Unified CJK
   Ideographic characters or of characters of another Unicode character
   block that are afforded similar treatment to that of Unified CJK
   Ideographic characters, then word is to be interpreted as if
   character were specified.
   
   2. Otherwise, word is to be interpreted as denoting a sequence of one
   or more characters that are not interpreted as an XML whitespace
   character.

Noting the "must" which is a testable conformance requirement, do the
following paragraphs contain one word or two?

<p>Hello&#x3000;World</p>
<p xml:lang="en">Hello&#x3000;World</p>
<p xml:lang="en">Hello&#x2004;World</p>
<p xml:lang="ja">Hello&#x3000;World</p>
<p xml:lang="ja">Hello&#x2004;World</p>
<p xml:lang="ja">Masayasu Ishikawa</p>

For a list of Unicode space characters, see for example
http://www.cs.tut.fi/~jkorpela/chars/spaces.html


-- 
 Chris Lilley                    mailto:chris@w3.org
 Interaction Domain Leader
 Co-Chair, W3C SVG Working Group
 W3C Graphics Activity Lead
 Co-Chair, W3C Hypertext CG

Received on Friday, 2 June 2006 19:04:25 UTC