- From: Mark Nottingham <mnot@mnot.net>
- Date: Wed, 17 Sep 2008 09:20:48 +1000
- To: John Cowan <cowan@ccil.org>
- Cc: URI <uri@w3.org>, Joe Gregorio <joe@bitworking.org>, David Orchard <orchard@pacificspirit.com>, Marc Hadley <Marc.Hadley@Sun.COM>
'required' is a bit strong; the relevant text in section 2.4 (below) at most implies that they can be treated as equivalent by implementations that choose to. > When a URI is dereferenced, the components and subcomponents > significant to the scheme-specific dereferencing process (if any) > must be parsed and separated before the percent-encoded octets > within those components can be safely decoded, as otherwise the data > may be mistaken for component delimiters. The only exception is for > percent-encoded octets corresponding to characters in the unreserved > set, which can be decoded at any time. For example, the octet > corresponding to the tilde ("~") character is often encoded as "%7E" > by older URI processing implementations; the "%7E" can be replaced > by "~" without changing its interpretation. On 17/09/2008, at 8:13 AM, John Cowan wrote: > Mark Nottingham scripsit: > >> There are just too many cases where the 'escape everything but >> unreserved' rule gets in the way; for example, if my template is >> "http://example.com/user/ {email}", I'm going to have percent- >> encoded @ >> signs in my URIs whether I like it or not -- even though they're not >> required to be percent- encoded there. > > It doesn't matter much, though, because anyone who decodes the URI > is required to treat %40 and @ synonymously in that position. > > -- > John Cowan > cowan@ccil.org > I am a member of a civilization. --David Brin -- Mark Nottingham http://www.mnot.net/
Received on Tuesday, 16 September 2008 23:21:28 UTC