W3C home > Mailing lists > Public > public-html@w3.org > March 2008

Editorial: "algorithm for extracting an encoding from a Content-Type"

From: Philip Taylor <pjt47@cam.ac.uk>
Date: Wed, 05 Mar 2008 16:33:03 +0000
Message-ID: <47CECB3F.6@cam.ac.uk>
To: HTML WG <public-html@w3.org>

In the "algorithm for extracting an encoding from a Content-Type":

"Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters 
that immediately follow the word equals sign (there might not be any)." 
- s/word//

"If it is a U+0022 QUOTATION MARK ('"') and there is a later U+0022 
QUOTATION MARK ('"') in s: Return string between the two quotation 
marks." - unclear since there can be multiple later quotation marks 
(e.g. <meta content=';charset="utf-8"oops"'>), so it should be explicit 
that it means the earliest one. Same about apostrophes.

-- 
Philip Taylor
pjt47@cam.ac.uk
Received on Wednesday, 5 March 2008 16:33:30 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 9 May 2012 00:16:13 GMT