OOXML part 2 A.1

Hi, I've no clue if that's still relevant and actual, but there
is apparently a bug in
<http://www.ecma-international.org/news/TC45_current_work/Office%20Open%20XML%20Part%202%20-%20Open%20Packaging%20Conventions%20(tagged).pdf>

| A.1 Creating an IRI from a Unicode String

| With reference to Arc [1-2] in Figure A–1, a Unicode string is
| converted to an IRI by percent-encoding each ASCII 16 character
| that does not belong to the set of reserved or unreserved 
| characters as defined in RFC 3986.

Percent-encoding controls, space, curly braces, angle brackets,
and "|" (all ASCII characters not permitted *anywhere* in an URI)
does not always yield an IRI.  Depending on their position in an
IRI "?", "#", square brackets, colon, slash, and others also have
to be percent-encoded.

Possibly A.1 is limited to some kind of "LEIRI", where that's no
issue, but as a general Unicode string to IRI recipe it's broken.

Apparently the W3C has a liaison to JTC 1 SC 34:
<http://www.itscj.ipsj.or.jp/sc34/open/lmember.htm>

IFF that bug is not fixed in the current version could the W3C 
please inform ISO about the syntax and semantics of IRIs ?

 Frank

Received on Sunday, 16 March 2008 21:36:13 UTC