- From: Thomas Roessler <tlr@w3.org>
- Date: Wed, 2 Sep 2009 19:32:03 +0200
- To: Larry Masinter <masinter@adobe.com>
- Cc: "PUBLIC-IRI@W3.ORG" <PUBLIC-IRI@w3.org>
On 2 Sep 2009, at 19:11, Larry Masinter wrote: > I'm still working on a draft that turns the MAY into a MUST for > ireg-name processing; it winds up rewriting a lot of the > document because it puts parsing before percent-encoding. > > I'd rather wait to discuss this until I have a draft ready > (had hoped to finish yesterday). > > One section I've stumbled on is: > > > Systems accepting IRIs MAY convert the ireg-name component of an IRI > as follows (before step 2 above) for schemes known to use domain > names in ireg-name, if the scheme definition does not allow percent- > encoding for ireg-name: Replace the ireg-name part of the IRI by the > part converted using the ToASCII operation specified in Section 4.1 > of [RFC3490] on each dot-separated label, and by using U+002E (FULL > STOP) as a label separator, with the flag UseSTD3ASCIIRules set to > TRUE, Another point related to yours: UseSTD3ASCIIRules should be FALSE here. Those are rules on the *registration* of domain names, and I don't see what they have to do in a specification that effectively deals with resolution. From a quick check using "_test0_α.does-not-exist.org" as a test case, it seems like at least the latest Safari and Firefox don't set that flag when trying to resolve an IRI reference. I did some archeology on the topic in March; the genesis of UseSTD3ASCIIRules being TRUE goes back to this note from Martin: http://www.imc.org/idn/mail-archive/msg07277.html ... which seems to be mistaken about the intent of some of the text in the original URI spec. > > and with the flag AllowUnassigned set to FALSE for creating > IRIs and set to TRUE otherwise. The ToASCII operation may fail, > but > this would mean that the IRI cannot be resolved. This conversion > SHOULD be used when the goal is to maximize interoperability with > legacy URI resolvers. For example, the IRI > "http://résumé.example.org" > may be converted to > "http://xn--rsum-bpad.example.org" > instead of > "http://r%C3%A9sum%C3%A9.example.org". > > > Can someone explain the AllowedUnassigned set to FALSE for "creating > IRIs"? This is in the middle of the algorithm for converting IRIs > (which is turning into converting 'parsed IRI components' into > 'parsed URI components'), but what is the applicability of > 'creating IRIs' when doing this mapping anyway? I'd think none, i.e., AllowUnassigned should be TRUE in this spot, for the very reason that you mention.
Received on Wednesday, 2 September 2009 17:32:16 UTC