W3C home > Mailing lists > Public > public-iri@w3.org > July 2011

RE: reviewing draft-weber-iri-guidelines-00

From: Phillips, Addison <addison@lab126.com>
Date: Tue, 5 Jul 2011 17:33:04 -0700
To: Mark Davis ☕ <mark@macchiato.com>
CC: Chris Weber <chris@lookout.net>, "PUBLIC-IRI@W3.ORG" <PUBLIC-IRI@w3.org>
Message-ID: <131F80DEA635F044946897AFDA9AC3476A94126B1A@EX-SEA31-D.ant.amazon.com>
   4.  Replace each entity references with its corresponding character.

This can't be done until after the fields of an IRI are parsed out. Example: in a path, you don't want an escaped / or # or ? to be transformed until after you've parsed out the path.

This is a catch-22 situation. Full normalization (which, as noted, we agree, should not be applied) would depend on unescaping the characters, whereas full unescaping, as you note, may break things. Perhaps this wants to be “unescape iunreserved characters”?

Addison


Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)

Internationalization is not a feature.
It is an architecture.
Received on Wednesday, 6 July 2011 00:33:43 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:14:42 UTC