- From: Nils Dagsson Moskopp <nils-dagsson-moskopp@dieweltistgarnichtso.net>
- Date: Mon, 12 Jul 2010 15:39:20 +0200
Mike Wilcox <mike at mikewilcox.net> schrieb am Mon, 12 Jul 2010 07:44:07 -0500: > That's a little different. Google purposely uses unstandardized, > incorrect HTML in ways that still render in a browser in order to > make it more difficult for screen scrapers. They also "break it" in a > different way every week. Assuming this is true (which I find difficult to believe), wouldn't a screen scraper based on the HTML5 parsing algorithm defeat this purpose ? Greetings, -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 230 bytes Desc: not available URL: <http://lists.whatwg.org/pipermail/whatwg-whatwg.org/attachments/20100712/21a6c0d0/attachment.pgp>
Received on Monday, 12 July 2010 06:39:20 UTC