W3C home > Mailing lists > Public > public-html-commits@w3.org > June 2009

html5/spec Overview.html,1.2471,1.2472

From: Ian Hickson via cvs-syncmail <cvsmail@w3.org>
Date: Sun, 28 Jun 2009 10:53:52 +0000
To: public-html-commits@w3.org
Message-Id: <E1MKs1B-0001OP-3l@lionel-hutz.w3.org>
Update of /sources/public/html5/spec
In directory hutz:/tmp/cvs-serv5327

Modified Files:
	Overview.html 
Log Message:
Elaborate on the rules for ASCII-compatible encodings (see last checkin). (credit: pt) (whatwg r3332)

Index: Overview.html
===================================================================
RCS file: /sources/public/html5/spec/Overview.html,v
retrieving revision 1.2471
retrieving revision 1.2472
diff -u -d -r1.2471 -r1.2472
--- Overview.html	28 Jun 2009 10:10:40 -0000	1.2471
+++ Overview.html	28 Jun 2009 10:53:50 -0000	1.2472
@@ -1540,11 +1540,11 @@
   the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
   0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
   character sets we want to support do things outside that range?
-  -->, ignoring cases where those bytes would be part of multibyte
-  sequences. <a href="#references">[RFC1345]</a><p class="note">This includes such exotic encodings as Shift_JIS and
+  -->, ignoring the second and later bytes of multibyte sequences. <a href="#references">[RFC1345]</a><p class="note">This includes such exotic encodings as Shift_JIS and
   variants of ISO-2022, even though it is possible for bytes like 0x70
   to be part of longer sequences that are unrelated to their
-  interpretation as ASCII.</p><!--
+  interpretation as ASCII. It excludes such encodings as UTF-7,
+  UTF-16, HZ-GB-2312, GSM03.38, and EBCDIC variants.</p><!--
    We'll have to change that if anyone comes up with a way to have a
    document that is valid as two different encodings at once, with
    different <meta charset> elements applying in each case.
Received on Sunday, 28 June 2009 10:54:02 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Sunday, 28 June 2009 10:54:04 GMT