[whatwg] Bug in "Before DOCTYPE name state"? from Anne van Kesteren on 2006-12-21 (public-whatwg-archive@w3.org from December 2006)

From: Anne van Kesteren <annevk@opera.com>
Date: Thu, 21 Dec 2006 18:15:05 +0100
Message-ID: <op.tkw57f1o64w2qv@id-c0020.orange.nl>

On Thu, 21 Dec 2006 18:09:43 +0100, Thomas Broyer <t.broyer at gmail.com>  
wrote:
> But it also has this note, which is quite confusing: "Because
> lowercase letters in the name are uppercased by the algorithm above,
> the "HTML" letters are actually case-insensitive relative to the
> markup."

During tokenization you store the lowercase ASCII characters as uppercase.  
So you can do a case-sensitive comparison with "HTML" in the end ("HTML"  
will also end up in the DOM or whatever model you use there).

In the markup it could be written as <!doctype html> which is what is  
suggested there.

> However, section 8.1.1 says:
> http://www.whatwg.org/specs/web-apps/current-work/#doctype
> """
> In other words, <!DOCTYPE HTML>, case-insensitively.
> """
>
> So I guess you're right.

Learned this when writing the implementation of it :-)

> It remains that the tokenization stage is a bit confusing?

-- 
Anne van Kesteren
<http://annevankesteren.nl/>
<http://www.opera.com/>

Received on Thursday, 21 December 2006 09:15:05 UTC