Re: C14N-Hash implementations???

> I am very curious whether anyone has done what I call C14N-Hash.
> That is, all C14N implementations I have heard of run exorbitantly
> long times.  I suspect that that runtime is due mostly to string
> concatenation operations.  If instead of building a single canonical
> XML string you walk a DOM and only send substrings to a hash
> accumulator, in the C14N order, you should be able to produce the
> C14N hash of a DOM structure in almost the time it takes to walk that
> structure for printing without canonicalization.
>
> So, has anyone done that experiment?  If so, how did it perform?

About c14n runtime, there are two basic different forms of c14n: (1) c14nize a full subtree which is moderately fast and (2) canonicalizing a node set (document subset) which takes much longer. The thing that really wastes time is to keep track of the [inscope namespace]s, whether you have to output one or not.

My estimation is that the most of the time spent is in the DOM tree traversal (including namespace administration), not in some string concatenations (which involves copy ops etc).

Christian

Received on Friday, 26 July 2002 10:59:24 UTC