In the "algorithm for extracting an encoding from a Content-Type": "Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters that immediately follow the word equals sign (there might not be any)." - s/word// "If it is a U+0022 QUOTATION MARK ('"') and there is a later U+0022 QUOTATION MARK ('"') in s: Return string between the two quotation marks." - unclear since there can be multiple later quotation marks (e.g. <meta content=';charset="utf-8"oops"'>), so it should be explicit that it means the earliest one. Same about apostrophes. -- Philip Taylor pjt47@cam.ac.ukReceived on Wednesday, 5 March 2008 16:33:30 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 9 May 2012 00:16:13 GMT