Re: Posted draft of URI comparison finding

>> That hardly matters.  Section 2.1 says that %7A and %7a both
>> represent the same octet, and therefore are guaranteed to be the same
>> character regardless of character encoding.  Whether or not that
>> character is 'z' is a different issue.
>
> Section 2.1 indeed says that %7a and %7A both represent the
> same octet. However, they are not therefore guaranteed to be
> the same character regardless of encoding, because there are
> encodings where a single byte is only representing part of
> a character, and/or can be used in representing different
> characters.

If %7E is used and you replace it with %7e, the character does
not change, regardless of encoding, and therefore the URI spec does
guarantee that the two are equivalent.  There is no need to split hairs
about whether it is one character of only part of one character -- the
result is the same because URI %xx rules govern that part of the encoding.

....Roy

Received on Thursday, 5 December 2002 19:30:01 UTC