- From: Dave Raggett <dsr@w3.org>
- Date: Fri, 24 Mar 2000 11:47:14 -0600
- To: "J. David Bryan" <jdbryan@acm.org>
- Cc: HTML Tidy List <html-tidy@w3.org>
On Wed, 23 Feb 2000, J. David Bryan wrote: > Whenever Tidy supplies a corrected DOCTYPE, it produces one such as > "!DOCTYPE html PUBLIC...", i.e., with the "html" in lower case. In > lexer.c, the "FindGivenVersion" routine, which is responsible for parsing > the DOCTYPE statement, has this comment at line 769: > > /* but at least ensure the case is correct */ > > FindGivenVersion then replaces the DOCTYPE string supplied in the source > HTML file with the identical string but containing "html" in lower case. > > Can someone please explain why changing this to lower case is > "correct?" Thanks. In SGML the case of a tag doesn't matter. In XML it does. For XHTML, which is a reformulation of HTML in XML, the W3C HTML working group was forced into making a choice of case for HTML elements and decided to use lower case. The root element "html" therefore needs to be in lower case for XHTML but could be in either case when the document conforms to SGML. Regards, -- Dave Raggett <dsr@w3.org> http://www.w3.org/People/Raggett tel/fax: +44 122 578 3011 (or 2521) +44 385 320 444 (mobile) World Wide Web Consortium (on assignment from HP Labs)
Received on Friday, 24 March 2000 13:13:06 UTC