In HTML and XML, character encoding forms and character set (= Unicode) are decoupled. As a result, in any character encoding form, it is always possible to access the whole range of Unicode characters. For instance, with iso-8859-1 encoding form, I can encode any Unicode character by using NCR like ス for the Japanese character ス. [YA] Whilst every browser accepts this in the HTML, it is an Internet Explorer peculiarity to convert non-transcodable characters in forms *submissions* to NCRs. For instance, if I type Lad¥u00E9d¥u00E9 (where ¥u00E9 represents a lowercased “e” with acute accent) in a form expecting Shift_JIS, IE will send Lad&233;d&233; whilst Netscape may send for example Lad?d? where the “?” are the result of trying to convert ¥u00E9 to Shift_JIS. YAReceived on Monday, 1 October 2001 18:22:29 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:57 GMT