- From: Frank Ellermann <nobody@xyzzy.claranet.de>
- Date: Sun, 16 Mar 2008 22:37:57 +0100
- To: public-iri@w3.org
Hi, I've no clue if that's still relevant and actual, but there is apparently a bug in <http://www.ecma-international.org/news/TC45_current_work/Office%20Open%20XML%20Part%202%20-%20Open%20Packaging%20Conventions%20(tagged).pdf> | A.1 Creating an IRI from a Unicode String | With reference to Arc [1-2] in Figure A–1, a Unicode string is | converted to an IRI by percent-encoding each ASCII 16 character | that does not belong to the set of reserved or unreserved | characters as defined in RFC 3986. Percent-encoding controls, space, curly braces, angle brackets, and "|" (all ASCII characters not permitted *anywhere* in an URI) does not always yield an IRI. Depending on their position in an IRI "?", "#", square brackets, colon, slash, and others also have to be percent-encoded. Possibly A.1 is limited to some kind of "LEIRI", where that's no issue, but as a general Unicode string to IRI recipe it's broken. Apparently the W3C has a liaison to JTC 1 SC 34: <http://www.itscj.ipsj.or.jp/sc34/open/lmember.htm> IFF that bug is not fixed in the current version could the W3C please inform ISO about the syntax and semantics of IRIs ? Frank
Received on Sunday, 16 March 2008 21:36:13 UTC