W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > September 2011

[Bug 14284] Need HTML parser algorithm options

From: <bugzilla@jessica.w3.org>
Date: Mon, 26 Sep 2011 05:22:37 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1R83eH-0004Lt-OZ@jessica.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=14284

--- Comment #1 from Henri Sivonen <hsivonen@iki.fi> 2011-09-26 05:22:36 UTC ---
XHR2 could do this without HTML providing flags by saying:
If final MIME type is text/html let document be Document object that represents
the response entity body parsed following the rules set forth in the HTML
specification for an HTML parser with scripting disabled, ignoring internal
encoding declarations discovered by the tree builder (i.e. only honoring the
<meta> prescan for the first 1024 bytes for internal encoding declarations),
without applying frequency analysis and using UTF-8 is the _default encoding_
in the last step of the algorithm for determining the character encoding.

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Monday, 26 September 2011 05:22:39 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 20:02:04 UTC