W3C home > Mailing lists > Public > www-international@w3.org > January to March 2007

RE: [CSS3 Text] Japanese Iteration Marks

From: Richard Ishida <ishida@w3.org>
Date: Wed, 10 Jan 2007 13:18:40 -0000
To: "'fantasai'" <fantasai.lists@inkedblade.net>, <www-style@w3.org>, "'WWW International'" <www-international@w3.org>
Message-ID: <009101c734b9$d8f99d60$6501a8c0@rishida>

Hi Fantasai,

See below...

============
Richard Ishida
Internationalization Lead
W3C (World Wide Web Consortium)
 
http://www.w3.org/People/Ishida/
http://www.w3.org/International/
http://people.w3.org/rishida/blog/
http://www.flickr.com/photos/ishida/
 
 

> -----Original Message-----
> From: www-international-request@w3.org 
> [mailto:www-international-request@w3.org] On Behalf Of fantasai
> Sent: 10 January 2007 07:05
...

> One of the differences between strict and loose is allowing 
> breaks before Japanese iteration marks. I have read in 
> various sources that when there's a break before an iteration 
> mark, the iteration mark should be replaced with the 
> appropriate character. Wikipedia describes and illustrates 
> this behavior here:
> 
>    http://en.wikipedia.org/wiki/Iteration_mark
>    http://en.wikipedia.org/wiki/Reference_mark#Repetition_marks

Actually, I can't find a description of line-breaking behaviour in these
WikiPedia articles.


> 
> I am currently speccing this in CSS3 Text. What I have so far is
> 
>     | If the UA breaks before a Japanese iteration mark, it 
> should* replace
>     | the iteration mark the character represented by the 
> iteration mark:

Perhaps you mean:
"If the UA breaks a line before a Japanese iteration mark, it should replace
the iteration mark on the new line with the characters it represents, as
follows:"

>     |
>     |   * the corresponding Han character, for IDEOGRAPHIC ITERATION
>     |     MARK (U+3005) and VERTICAL IDEOGRAPHIC ITERATION 
> MARK (U+303B)
>     |   * the (appropriately-voiced) corresponding Hiragana character,
>     |     for HIRAGANA ITERATION MARK (U+309D) and HIRAGANA 
> VOICED ITERATION
>     |     MARK (U+309E)
>     |   * the (appropriately-voiced) corresponding Katakana character,
>     |     for KATAKANA ITERATION MARK (U+30FD) and KATAKANA 
> VOICED ITERATION
>     |     MARK (U+30FE)
>     |

What if the line break occurs after the first of three consecutive iteration
marks, eg:
????
??
Ie. 
tokoroIm
ImIm

Does the first IM get converted to ? (do) as well as the others, or just the
ones on the new line?

I'm assuming that multiple Ims follow the same rule, in which case the
wording above should probably say 'before one or more Japanese iteration
marks'.


>     | A sequence of <var>N</var> iteration marks corresponds 
> to the sequence
>     | of <var>N</var> characters that immediately precedes 
> it. If an iteration
>     | mark does not correspond to a character of the matching 
> script type,
>     | then the iteration mark must not be replaced.

Is this last rule to deal with the situation where a page is describing
iteration marks, and includes examples in the text surrounded by some type
of quote marks, or such?

RI

-- 
No virus found in this outgoing message.
Checked by AVG Free Edition.
Version: 7.5.432 / Virus Database: 268.16.8/621 - Release Date: 09/01/2007
13:37
 
Received on Wednesday, 10 January 2007 13:17:30 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:09 GMT