Re: Canonicalization xml:base processing

Konrad Lanz scripsit:

> Up front I'd like to mention that after talking to Jose Kahan and 
> thinking about the issue for a little longer we'd still prefer to also 
> perform "dot and dot-dot canonicalization" (aka. remove_dot_segments). 
> It will allow the reuse of existing implementations for relative URI 
> resolution. More important from my point of view however is: "dot and 
> dot-dot canonicalization" allows to map more equivalent documents onto 
> the same serialized output and helps to avoid false negatives in XMLDSig.

Unfortunately doing so produces inaccurate results.  For example, suppose
the outermost xml:base value is "foo/bar/" and the next inner one is
"../../..".  Ordinary remove_dot_segments will reduce this to the null
string, but this is incorrect.

Leaving it as "foo/bar/../../.." produces the same result as in the
original context when applied to an equivalent base URI.  It is only
safe to remove dots when that base URI is fully available.

-- 
The man that wanders far                        cowan@ccil.org
from the walking tree                           http://www.ccil.org/~cowan
        --first line of a non-existent poem by:         John Cowan

Received on Thursday, 18 May 2006 15:45:18 UTC