Re: Some issues with the IRI document [legacyNFC-06] from Paul Hoffman / IMC on 2003-04-16 (public-iri@w3.org from April 2003)

From: Paul Hoffman / IMC <phoffman@imc.org>
Date: Tue, 15 Apr 2003 19:48:08 -0700
To: public-iri@w3.org
Message-Id: <p0521063bbac274df5fbe@[142.131.246.132]>

>At 08:14 03/04/08 -0700, Paul Hoffman / IMC wrote:
>
>>Technical issues:
>
>>I do not understand the logic of having Variants (B) and (C) in 
>>step 1 in section 3.1. One is normalized, the other one isn't. 
>>Doesn't this sound like a recipe for disaster? Why did you 
>>differentiate between these two cases?
>
>This is listed as issue
>http://www.w3.org/International/iri-edit/Overview.html#legacyNFC-06
>
>This is carefully based on the principle of early uniform normalization
>as described in the W3C Character Model. The assumption is that
>Unicode-based encodings are for the most part already in NFC
>(and where they are not, this may be on purpose). However,
>for non-Unicode encodings, normalization when converting is
>sometimes necessary (the most obvious example is windows-1258,
>for Vietnamese).

I guess this goes back to my question from the previous message: how 
do you know what encoding you are looking at?

--Paul Hoffman, Director
--Internet Mail Consortium

Received on Tuesday, 15 April 2003 22:48:19 UTC