W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > February 2012

[Bug 16106] Clarify paragraph about character references in tokenization.html

From: <bugzilla@jessica.w3.org>
Date: Tue, 28 Feb 2012 10:59:06 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1S2Klu-0000sk-4F@jessica.w3.org>
https://www.w3.org/Bugs/Public/show_bug.cgi?id=16106

--- Comment #3 from Ezio Melotti <ezio.melotti@gmail.com> 2012-02-28 10:59:04 UTC ---
One solution would be to use a list like the in the rest of the page, so
something like:
...
→ 0xD800 to 0xDFFF
→ greater than 0x10FFFF
    Parse error.  Return U+FFFD.
→ 0x0001 to 0x0008
→ 0x000E to 0x001F
→ ...
    Parse error. Treat it as per the "anything else" entry below.
→ Anything else
    Return a character token for the Unicode character whose code point is that
number.

-- 
Configure bugmail: https://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Tuesday, 28 February 2012 10:59:10 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 28 February 2012 10:59:14 GMT