- From: olivier Thereaux <ot@w3.org>
- Date: Wed, 13 Jul 2005 14:05:06 +0900
- To: Philip TAYLOR <P.Taylor@Rhul.Ac.Uk>
- Cc: www-validator@w3.org
Hi Philip,
Thanks for checking the beta validator.
On Jul 12, 2005, at 22:06, Philip TAYLOR wrote:
> http://validator.w3.org:8001/check?uri=http://www.rhul.ac.uk/ ->
>
> Unknown Document Type and Parse Mode!
I checked the part of the code that issued this warning. The said
warning only happens when:
- the pre-parsing found a Doctype
- and the content-type cannot disambiguate whether to use XGML or XML
mode (i.e, text/html)
- ... but the doctype is not in our types database with info to
disambiguate the mode
so instead of
[[
The MIME Media Type (text/html) for this document is used to serve both
SGML and XML based documents, and no DOCTYPE Declaration was found to
disambiguate it. Parsing will continue in SGML mode and with a fallback
DOCTYPE similar to HTML 4.01 Transitional.
]]
I think it should be something like
[[
The MIME Media Type (text/html) for this document is used to serve both
SGML and XML based documents, and it is not possible to disambiguate it
based on the DOCTYPE Declaration in your document. Parsing will
continue in SGML mode.
]]
I think Terje initially wrote this, he's really busy these days but
I'll try to see if he can give it a look.
Now for the other issue...
> I should add that it commences :
> 1: <!DOCTYPE HTML PUBLIC "-//RHBNC//DTD HTML 4.01 Augmented//EN"
> 2: "http://www.rhul.ac.uk/shared/dtds/HTML-4.01-Augmented.dtd"
> 3: >
> Error Line 76 column 27: general entity "nbsp" not defined and no
> default entity.
> This diagnostic is not issued by the current validator
This is SGML territory, so hopefully someone will be able to confirm,
or correct, my understanding of the situation.
* You are using a "custom" DTD, based on a copy of the HTML 4.01 DTD,
and which you're publishing at:
http://www.rhul.ac.uk/shared/dtds/HTML-4.01-Augmented.dtd
* In that DTD, the reference to entities is made (as in HTML 4.01) with
relative URIS, e.g:
<!ENTITY % HTMLlat1 PUBLIC
"-//W3C//ENTITIES Latin1//EN//HTML"
"HTMLlat1.ent">
%HTMLlat1;
But there is nothing at http://www.rhul.ac.uk/shared/dtds/HTMLlat1.ent
Isn't that a mistake?
Now the reason why the "usual" validator (v0.6.7) does not complain
about this is that the SGML catalogue it uses knows how to dereference
the "-//W3C//ENTITIES Latin1//EN//HTML" FPI, whereas the "new"
validator has a catalogue that only knows "-//W3C//ENTITIES Latin
1//EN//HTML". This is most likely a victim of a cleanup of the said
catalogue. The cleanup was a bit zealous and it's possible that this
removal was a mistake. Hmm, quite probable actually, the DTD in the
HTML4.01 spec uses the "Latin1" FPI, not "Latin 1". Could anyone among
our SGML gurus confirm?
Thanks,
--
olivier
Received on Wednesday, 13 July 2005 05:05:17 UTC