W3C home > Mailing lists > Public > www-international@w3.org > July to September 2005

comments on Using character entities and NCRs (markus scherer)

From: Felix Sasaki <fsasaki@w3.org>
Date: Thu, 21 Jul 2005 11:49:29 +0900
Message-ID: <op.st8ssrapx1753t@ibm-60d333fc0ec.w3.mag.keio.ac.jp>
To: "www-international@w3.org" <www-international@w3.org>



------- Forwarded message -------
From: "Markus Scherer" <markus.scherer@us.ibm.com>
To: www-international@w3.org
Cc: "Mark Davis" <mark.davis@us.ibm.com>, "Uma Umamaheswaran"  
<umavs@ca.ibm.com>
Subject: [Moderator Action] comments on Using character entities and NCRs  
(markus)
Date: Thu, 21 Jul 2005 01:11:47 +0900

Title: Using character entities and NCRs
http://www.w3.org/International/questions/qa-escapes

Nice!

Please add the following:
1. Some web browsers send HTML form results using NCRs.
2. When converting XML or HTML files (or form results) to Unicode: First
convert the text from its encoding to Unicode, then handle any markup,
then un-escape NCRs and character entities.
3. Please list all of the character entities predefined by the XML
Recommendation itself - it includes gt/lt/amp, does it include quot/apos
and/or anything else?
4. Please list the code points in XML 1.0 and 1.1 that _must_ be
represented with NCRs or entities, like C0 controls or just NUL.
5. Please make a stronger recommendation for using UTF-8.

Thanks,
markus

Markus Scherer  マルクス  IBM GCoC-Unicode/ICU  San José, CA ⇀
http://ibm.com/software/globalization/icu
Received on Thursday, 21 July 2005 02:49:37 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:05 GMT