- From: Norman Walsh <ndw@nwalsh.com>
- Date: Thu, 12 Jan 2012 06:22:45 -0800
- To: public-xml-core-wg@w3.org
- Message-ID: <m2hb016lvu.fsf@nwalsh.com>
"Henry S. Thompson" <ht@inf.ed.ac.uk> writes:
> So net-net I think we should ask for the following as the beginning of
> Section 3 of Polyglot [2]:
>
> Polyglot markup uses the UTF-8 character encoding, the only character
> encoding for which both HTML and XML require support. HTML requires
> UTF-8 to be explicitly declared to avoid fallback to a legacy encoding
> [HTML5]. For XML, UTF-8 is an encoding default. As such, character
> encoding may be left undeclared in XML with the result that UTF-8 is
> still supported [XML10].
>
> Polyglot markup declares the UTF-8 character encoding in the following
> ways, which may be used separately or in combination:
>
> * Within the document
> . By using <meta charset="UTF-8"/> (the HTML encoding
> declaration) -- preferred
> . By using the Byte Order Mark (BOM) character.
>
> * Outside the document
> . . .
+1
Be seeing you,
norm
--
Norman Walsh
Lead Engineer
MarkLogic Corporation
Phone: +1 413 624 6676
www.marklogic.com
Received on Thursday, 12 January 2012 14:23:23 UTC