W3C home > Mailing lists > Public > public-html@w3.org > September 2010

RE: i18n Polyglot Markup/NCRs (7th issue)

From: Eliot Graff <eliotgra@microsoft.com>
Date: Wed, 29 Sep 2010 22:48:11 +0000
To: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>, Henri Sivonen <hsivonen@iki.fi>
CC: public-html <public-html@w3.org>, "public-i18n-core@w3.org" <public-i18n-core@w3.org>
Message-ID: <CE3A5BFD1228D84A8D9C158EEC195FD50EBB3D4F@TK5EX14MBXW601.wingroup.windeploy.ntdev.microsoft.com>
Per bug 10154  [1] I have edited the section so that it now reads as follows:

]]
8. Named Entity References

Polyglot markup uses only the following named entity references:

    * amp
    * lt
    * gt
    * apos
    * quot

For entities beyond the previous list, a polyglot document uses character references. For example, polyglot markup uses &#xA0; instead of &nbsp;. Note that polyglot markup may use decimal values for escape characters (such as &#160; in the previous example); however, the Character Model for the World Wide Web states that content should use the hexadecimal form of character escapes rather than the decimal form when there are both. [CHARMOD]
[[

I believe that this satisfies the original request and the attempt to indicate that polyglot markup may use either hexadecimal or decimal.

Can someone please edit the information on the Internationalization Comments on Polyglot Markup: HTML-Compatible XHTML Documents [2] to indicate this change is made?

Thanks for your help and patience.

Eliot

[1] http://www.w3.org/Bugs/Public/show_bug.cgi?id=10154
[2] http://www.w3.org/International/reviews/1007-polyglot/

-----Original Message-----
From: Leif Halvard Silli [mailto:xn--mlform-iua@målform.no] 
Sent: Tuesday, July 20, 2010 9:58 PM
To: Henri Sivonen
Cc: public-html; Eliot Graff; public-i18n-core@w3.org
Subject: Re: i18n Polyglot Markup/NCRs (7th issue)

Henri Sivonen, Mon, 19 Jul 2010 06:35:02 -0700 (PDT):
> Leif wrote:
>> First of all, my comment was to Richard, who suggested that POlyglot 
>> markup should "favor" hexadecimal NCRs.
> 
> I think neither decimal nor hexadecimal can be preferred over the 
> other on polyglot grounds, so the publication shouldn't prefer one 
> over the other.

Such was my initial reaction as well. And I prefer that direction.  It also isn't the purpose of Polyglot Markup to permit served as HTML things which text/htmll does not open up for.

(For a reply to the document encoding issues, see separate reply.)
--
leif halvard silli
Received on Wednesday, 29 September 2010 22:52:11 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 9 May 2012 00:17:15 GMT