- From: Martin J. Dürst <duerst@it.aoyama.ac.jp>
- Date: Sat, 23 Oct 2010 07:19:29 +0900
- To: "Phillips, Addison" <addison@lab126.com>
- CC: "public-iri@w3.org" <public-iri@w3.org>, Bjoern Hoehrmann <derhoermi@gmx.net>
On 2010/10/23 6:40, Phillips, Addison wrote: > (co-chair hat OFF) [editor hat on] > Trac issue: http://trac.tools.ietf.org/wg/iri/trac/ticket/23 > > So here is a slightly different version of the same proposal. Bjoern, does this suitably allay your concerns? I recognize that you would prefer us to eliminate the conversion step altogether, but some feel that the conversion step is not altogether obvious, even if it is intrinsic to what follows. > --- > An IRI or IRI reference is a sequence of characters from the UCS. For IRIs that are not already encoded in Unicode (as when written on paper, read aloud, or represented in a text stream using a legacy character encoding), convert the IRI to Unicode. Note that some character encodings or transcriptions can be converted to or represented by more than one sequence of Unicode characters. Ideally the resulting IRI would use a normalized form, such as Unicode Normalization Form C [UAX15] (see [Section 5] Normalization and Comparison), since that ensures a stable, consistent representation that is most likely to produce the intended results. Implementers and users are cautioned that, while denormalized character sequences are valid, they might be difficult for other users or processes to reproduce and might lead to unexpected results. > --- I have implemented this change as http://trac.tools.ietf.org/wg/iri/trac/changeset/22/, with a slight tweak to align it with previous paragraph. Please feel free to propose further changes if necessary (exact text strongly preferred). Regards, Martin. -- #-# Martin J. Dürst, Professor, Aoyama Gakuin University #-# http://www.sw.it.aoyama.ac.jp mailto:duerst@it.aoyama.ac.jp
Received on Friday, 22 October 2010 22:20:31 UTC