Re: Equivalence measures

"Nick Kew":

> I've hacked up a demo-of-concept for computing markup equivalence.
> I think this shows fairly clearly "can be done" (modulo likely bugs:-)
>
> I have some interesting thoughts arising from this - more anon.
>
> http://valet.webthing.com/misc/dochash.html

Seems to work well, looking at various sites (from
http://jibbering.com/faq/  - those with a dashed blue CSS border are
likely canditates for changing the content but not the real content of
the page, not all of them some have genuinely been updated.)

Hashes for http://www.4guysfromrolla.com/

All Content:  +uc9IsEiBp+tN515ZrzNPA
All Content:  ZbwUTd1AefPXRFXuoRbSgA

Elements+Attributes: 5mkWkrX4jHkW6puPxemLXg
Elements+Attributes: M12A0q8ru2Hwa68MokMu6Q

Elements:  H3SxvXfXr4/A/sEMEJfv1A
Elements:  H3SxvXfXr4/A/sEMEJfv1A

Headings:  1B2M2Y8AsgTpgAmY7PhCfg
Headings:  1B2M2Y8AsgTpgAmY7PhCfg

Hashes for http://members.tripod.com/~housten/download/

All Content:  Hrba1cfPUSe7HXJ7/ladqg
All Content:  /CZ+eGBCvYvCQ15aOHAGxA

Elements+Attributes: rUQEgeDpnItb0yiTcle6Qg
Elements+Attributes: OM3rRcYmAe7EgQx6/geLpA

Elements:  ux6FOM1ocfSGq1uZlK94OQ
Elements:  ux6FOM1ocfSGq1uZlK94OQ

Headings:  3ILvnfWNiyZV5IPzkYTALw
Headings:  3ILvnfWNiyZV5IPzkYTALw

I assume you'll be sharing the exact method of creating the hashes?

Cheers,

Jim.

Received on Tuesday, 11 December 2001 06:51:16 UTC