- From: by way of Martin Duerst <chris@w3.org>
- Date: Sun, 06 Mar 2005 07:23:57 +0900
- To: public-iri@w3.org
On Saturday, March 5, 2005, 12:13:40 PM, Martin wrote: MD> Hello Chris, MD> Many thanks for your report. I have listed this issue as MD> http://www.w3.org/International/iri-edit#theseCharacters-102. MD> At 04:20 05/03/05, Chris Lilley wrote: >>Hello public-iri, >> >>In section 3.1 of RFC 3987, it says >> >> > Systems accepting IRIs MAY also deal with the printable characters in >> > US-ASCII that are not allowed in URIs, namely "<", ">", '"', space, >> > "{", "}", "|", "\", "^", and "`", in step 2 above. If these characters >> > are found but are not converted, then the conversion SHOULD fail. >> > Please note that the number sign ("#"), the percent sign ("%"), and >> > the square bracket characters ("[", "]") are not part of the above >> > list and MUST NOT be converted. Protocols and formats that have used >> > earlier definitions of IRIs including these characters >> >>In the third sentence, 'these characters' is ambiguous. i read it as >>referring to the most recently mentioned list - # % [ ] - but in fact (I >>confirmed with Martin) it refers to the first list. MD> It is very clear, at least to me, that the only thing that makes sense MD> is indeed that this refers to the first list. There are no protocols MD> or formats that have used "#", "%", "[", and "]". And escaping any of MD> them would wreck havoc on the IRI/URI itself. MD> But of course, the current text is at least misleading. I agree that, when the text is fully understood, only one interpretation makes sense. When the text is read i the process of gaining understanding, the ambiguity delays comprehension. >>Please reword to make this unambiguous. MD> Here is one way to reorder the sentences in the paragraph. MD> Not sure yet that this is the best version, however. I agree this is clearer. MD> Systems accepting IRIs MAY also deal with the printable characters in MD> US-ASCII that are not allowed in URIs, namely "<", ">", '"', space, MD> "{", "}", "|", "\", "^", and "`", in step 2 above. If these MD> characters are found but are not converted, then the conversion MD> SHOULD fail. Protocols and formats that have used earlier definitions MD> of IRIs including these characters MAY require percent-encoding of MD> these characters as a preprocessing step to extract the actual MD> IRI from a given field. This preprocessing MAY also be used by MD> applications allowing the user to enter an IRI. Please note that MD> the number sign ("#"), the percent sign ("%"), and the square MD> bracket characters ("[", "]") are not part of the above list MD> and MUST NOT be converted. MD> Regards, Martin. -- Chris Lilley mailto:chris@w3.org Chair, W3C SVG Working Group W3C Graphics Activity Lead
Received on Tuesday, 19 April 2005 10:05:32 UTC