- From: Anne van Kesteren <annevk@opera.com>
- Date: Thu, 21 Dec 2006 18:15:05 +0100
On Thu, 21 Dec 2006 18:09:43 +0100, Thomas Broyer <t.broyer at gmail.com> wrote: > But it also has this note, which is quite confusing: "Because > lowercase letters in the name are uppercased by the algorithm above, > the "HTML" letters are actually case-insensitive relative to the > markup." During tokenization you store the lowercase ASCII characters as uppercase. So you can do a case-sensitive comparison with "HTML" in the end ("HTML" will also end up in the DOM or whatever model you use there). In the markup it could be written as <!doctype html> which is what is suggested there. > However, section 8.1.1 says: > http://www.whatwg.org/specs/web-apps/current-work/#doctype > """ > In other words, <!DOCTYPE HTML>, case-insensitively. > """ > > So I guess you're right. Learned this when writing the implementation of it :-) > It remains that the tokenization stage is a bit confusing? -- Anne van Kesteren <http://annevankesteren.nl/> <http://www.opera.com/>
Received on Thursday, 21 December 2006 09:15:05 UTC