W3C home > Mailing lists > Public > whatwg@whatwg.org > July 2010

[whatwg] Article: Growing pains afflict HTML5 standardization

From: Nils Dagsson Moskopp <nils-dagsson-moskopp@dieweltistgarnichtso.net>
Date: Mon, 12 Jul 2010 15:39:20 +0200
Message-ID: <20100712153920.2ffd3947@desudesudesu>
Mike Wilcox <mike at mikewilcox.net> schrieb am Mon, 12 Jul 2010 07:44:07
-0500:

> That's a little different. Google purposely uses unstandardized,
> incorrect HTML in ways that still render in a browser in order to
> make it more difficult for screen scrapers. They also "break it" in a
> different way every week.

Assuming this is true (which I find difficult to believe), wouldn't a
screen scraper based on the HTML5 parsing algorithm defeat this
purpose ?

Greetings,
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 230 bytes
Desc: not available
URL: <http://lists.whatwg.org/pipermail/whatwg-whatwg.org/attachments/20100712/21a6c0d0/attachment.pgp>
Received on Monday, 12 July 2010 06:39:20 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:59:24 UTC