- From: John Dunlop <jdunlop@cis.strath.ac.uk>
- Date: Sat, 17 May 2003 17:55:27 +0100
- To: www-validator@w3.org
Pottering about with the validator today, up cropped a wee inconsistency. Perhaps unimportant in the grand scheme of things, and label me as a nitpicker of you like, but I believe it's worthy of a mention nonetheless. To set the scene: if I have an entity reference with the name "foo:bar" ("&foo:bar;" for instance), the validator will correctly spot and report this as an error, since that entity isn't defined in HTML 4.01. That's fine and dandy; everything's running marvelously. My somewhat pedantic dispute is with the error message displayed. Although it doesn't belie the reason for the error, it seems that there is a glitch. For an entity reference with the name "foo:bar", the error message is: [...] cannot generate system identifier for general entity "foo Notice that the entity name is cut short, the colon and the following name characters are sliced off. The validator seems to be choking on the colon, which is a valid name character, as we can see by looking at the SGML declaration of HTML 4.01: NAMING LCNMSTRT "" UCNMSTRT "" LCNMCHAR ".-_:" UCNMCHAR ".-_:" The other additions to the set of possible name characters in HTML don't have the same symptom. Surely the message should indicate the entity name as "foo:bar"? Or are there some mystifying workings going on that have passed me by? Neither the WDG's validator nor Nick Kew's Page Valet show this symptom. Feed this to the validator (I'll leave it up for some time): http://www.cis.strath.ac.uk/~jdunlop/entity-name.html I'll subscribe to the list for a while too. Cheers, -- Jock
Received on Saturday, 17 May 2003 13:00:58 UTC