- From: <bugzilla@wiggum.w3.org>
- Date: Tue, 11 Sep 2007 06:35:56 +0000
- To: www-validator-cvs@w3.org
- CC:
http://www.w3.org/Bugs/Public/show_bug.cgi?id=5031
Summary: Doctype detection fails if root element includes non
"word" character
Product: Validator
Version: 0.8.1
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P3
Component: check
AssignedTo: ot@w3.org
ReportedBy: ot@w3.org
QAContact: www-validator-cvs@w3.org
The doctype detection routine in preparse_doctype() has the following regexp to
detect FPI and SI:
m(<!DOCTYPE\s+(\w+)\s+(?:PUBLIC|SYSTEM)\s+...
the first (\w+) is the name of the document type, which has to be the root
element
(ref: http://www.w3.org/TR/xml/#vc-roottype )
but the \w+ is incorrect, as the root element can (among others) have a dash or
dot.
(ref: http://www.w3.org/TR/xml/#IDANQDS )
This half-breaks detection of the doctype for languages with root element
including non "perl word (alphanum plus _)" characters.
Received on Tuesday, 11 September 2007 06:36:01 UTC