Re: Percent encoding

On Tue, 2 Mar 2010, Raphaël Troncy wrote:

> Dear Philip,
>
>> Perhaps YouTube decodes first and splits last, or perhaps they just use
>> a regexp to find v=XXXXX anywhere. Whatever is the case with YouTube, I
>> assume we want to match as closely as possible how query strings works
>> in e.g. ASP, PHP, JSP and Perl CGI, or there is no benefit in using
>> something that resembles query strings.
>> 
>> We can never be 100% compatible, for reasons listed in a note after
>> http://www.w3.org/2008/WebVideo/Fragments/WD-media-fragments-spec/#decode-a-percent-encoded-string
>
> Thanks, the note is indeed really useful. For all the following statements, 
> do you think it is possible to indicate a suitable reference?
>    *  "&" is the only primary separator for name-value pairs, but some 
> server-side languages also treat ";" as a separator.
>    * name-value pairs with invalid percent-encoding should be ignored, but 
> some server-side languages silently mask such errors.
>    * The "+" character should not be treated specially, but some server-side 
> languages replace it with a space (" ") character.

+ is in sub-delims, along with & ; and others
(cf rfc3986)

>    * Multiple occurrences of the same name must be preserved, but some 
> server-side languages only preserve the last occurrence.
>
> Best regards.
>
>  Raphaël
>
>

-- 
Baroula que barouleras, au tiéu toujou t'entourneras.

         ~~Yves

Received on Wednesday, 3 March 2010 13:31:10 UTC