comments on Using character entities and NCRs (markus scherer)

------- Forwarded message -------
From: "Markus Scherer" <markus.scherer@us.ibm.com>
To: www-international@w3.org
Cc: "Mark Davis" <mark.davis@us.ibm.com>, "Uma Umamaheswaran"  
<umavs@ca.ibm.com>
Subject: [Moderator Action] comments on Using character entities and NCRs  
(markus)
Date: Thu, 21 Jul 2005 01:11:47 +0900

Title: Using character entities and NCRs
http://www.w3.org/International/questions/qa-escapes

Nice!

Please add the following:
1. Some web browsers send HTML form results using NCRs.
2. When converting XML or HTML files (or form results) to Unicode: First
convert the text from its encoding to Unicode, then handle any markup,
then un-escape NCRs and character entities.
3. Please list all of the character entities predefined by the XML
Recommendation itself - it includes gt/lt/amp, does it include quot/apos
and/or anything else?
4. Please list the code points in XML 1.0 and 1.1 that _must_ be
represented with NCRs or entities, like C0 controls or just NUL.
5. Please make a stronger recommendation for using UTF-8.

Thanks,
markus

Markus Scherer  マルクス  IBM GCoC-Unicode/ICU  San José, CA ⇀
http://ibm.com/software/globalization/icu

Received on Thursday, 21 July 2005 02:49:37 UTC