W3C home > Mailing lists > Public > public-xml-core-wg@w3.org > January 2012

Re: Other possible issues with the Polyglot draft

From: Norman Walsh <ndw@nwalsh.com>
Date: Thu, 12 Jan 2012 06:22:45 -0800
To: public-xml-core-wg@w3.org
Message-ID: <m2hb016lvu.fsf@nwalsh.com>
"Henry S. Thompson" <ht@inf.ed.ac.uk> writes:
> So net-net I think we should ask for the following as the beginning of
> Section 3 of Polyglot [2]:
>
>    Polyglot markup uses the UTF-8 character encoding, the only character
>    encoding for which both HTML and XML require support. HTML requires
>    UTF-8 to be explicitly declared to avoid fallback to a legacy encoding
>    [HTML5]. For XML, UTF-8 is an encoding default. As such, character
>    encoding may be left undeclared in XML with the result that UTF-8 is
>    still supported [XML10].
>
>    Polyglot markup declares the UTF-8 character encoding in the following
>    ways, which may be used separately or in combination:
>
>    * Within the document
>      . By using <meta charset="UTF-8"/> (the HTML encoding
>         declaration) -- preferred
>      . By using the Byte Order Mark (BOM) character.
>
>    * Outside the document
>      . . .

+1

                                        Be seeing you,
                                          norm

-- 
Norman Walsh
Lead Engineer
MarkLogic Corporation
Phone: +1 413 624 6676
www.marklogic.com

Received on Thursday, 12 January 2012 14:23:23 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:16:43 UTC