- From: Felix Sasaki <fsasaki@w3.org>
- Date: Thu, 21 Jul 2005 11:49:29 +0900
- To: "www-international@w3.org" <www-international@w3.org>
------- Forwarded message ------- From: "Markus Scherer" <markus.scherer@us.ibm.com> To: www-international@w3.org Cc: "Mark Davis" <mark.davis@us.ibm.com>, "Uma Umamaheswaran" <umavs@ca.ibm.com> Subject: [Moderator Action] comments on Using character entities and NCRs (markus) Date: Thu, 21 Jul 2005 01:11:47 +0900 Title: Using character entities and NCRs http://www.w3.org/International/questions/qa-escapes Nice! Please add the following: 1. Some web browsers send HTML form results using NCRs. 2. When converting XML or HTML files (or form results) to Unicode: First convert the text from its encoding to Unicode, then handle any markup, then un-escape NCRs and character entities. 3. Please list all of the character entities predefined by the XML Recommendation itself - it includes gt/lt/amp, does it include quot/apos and/or anything else? 4. Please list the code points in XML 1.0 and 1.1 that _must_ be represented with NCRs or entities, like C0 controls or just NUL. 5. Please make a stronger recommendation for using UTF-8. Thanks, markus Markus Scherer マルクス IBM GCoC-Unicode/ICU San José, CA ⇀ http://ibm.com/software/globalization/icu ↼
Received on Thursday, 21 July 2005 02:49:37 UTC