W3C home > Mailing lists > Public > www-validator@w3.org > February 2003

autodetecting character encoding

From: Darin McGrew <mcgrew@stanfordalumni.org>
Date: Fri, 07 Feb 2003 18:14:14 -0800
To: www-validator@w3.org
Message-Id: <20030208021415.51F712B1D3@mauve-new.rahul.net>

Appendix F of the XML 1.0 Recommendation specifies ways to autodetect the
character encoding of XML documents, and this works fine for documents that
start with the four bytes 3C 3F 78 6D ("<?xm"). Maybe we need a similar
mechanism for valid HTML documents, documents that start with the four
bytes 3C 21 44 4F ("<!DO").

This would make it a lot easier to upload documents to the validator, when
the documents will have their character encoding specified via the HTTP
Content-Type field when retrieved from the server.

Darin McGrew, mcgrew@stanfordalumni.org, http://www.rahul.net/mcgrew/
    Web Design Group, darin@htmlhelp.com, http://www.HTMLHelp.com/

key ring /'kE 'ri[ng]/ n. device enabling simultaneous loss of multiple keys
Received on Friday, 7 February 2003 21:17:11 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:05 GMT