Re: -//W3C/DTD XHTML gives no error in Markup validator 0.80 ?

On 8/12/07, Marc Gueury <mgueury@skynet.be> wrote:
>
>  The reason is because OpenSP lack of proxy support. My extension detects
> the following doctype as a XML file and run opensp 1.5.2 with xml.soc on it
> (like the real validator)
>  <!DOCTYPE html PUBLIC "-//W3C/DTD XHTML 1.0 Strict//EN"
> "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
>  The difference, is that my extension can be running behind a proxy. So,
>  -//W3C/DTD XHTML 1.0 Strict//EN
>  does not allow me to find the DTD. And I can not get the
> http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd from
> internet since I can not explain to OpenSP to use the proxy settings of
> Firefox.

Hello Marc,
One way to get around this would be to deliver a catalog of commonly
used DTDs along with your extension. Your entension could then use its
private, local catalog for most validations. That would make it faster
(because it wouldn't have to fetch DTDs from w3.org) and would also
enable it to be used offline. The disadvantage is that it would
increase the installation footprint of your extension. I have a
catalog like this for my OpenSP-based validator and it occupies about
3.2M.

HTH
Philip


>  Olivier Thereaux wrote:
> Hi Marc,
>
>  On Thu, Aug 02, 2007, Marc Gueury wrote:
>
> Hello all,
>
>  I have a user who has noticed this page is
>  http://www.mt-olympus.com/
>  is valid html strict as reported here:
>  http://validator.w3.org/check?verbose=1 ... pus.com%2F
> <http://validator.w3.org/check?verbose=1&uri=http%3A%2F%2Fwww.mt-olympus.com%2F>
>
>  Yes, it is "valid", though valid _what_ is the key here.
>
>
> In the new version of the validator 0.80, there is 0 errors.
>
>  Right. There should be at least a warning that the FPI ("-//W3C/DTD
>  XHTML 1.0 Strict//EN") does not match the SI,
>  (http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd), but
> as far as
>  formal validation is concerned, the document is "valid".
>
>
> In the version 0.70, it reported 41 errors,
>
>  41? At this point in time I see only two errors given by the validator
>  0.7.4 on that page.
> http://qa-dev.w3.org/wmvs/0.7.4/check?uri=http%3A%2F%2Fwww.mt-olympus.com%2F
>
>  the basic reason was this
>
> error in the above file:
>
>  <!DOCTYPE html PUBLIC "-//W3C/DTD XHTML 1.0 Strict//EN" ...
>  should be
>  <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" ...
>  Notice the '/'
>
>  It's a tad more complicated than that.
>
>  * The FPI is bogus
>  * the document is XHTML-ish, but served as text/html (i.e. not served as
> XML)
>
>  The previous version of the validator would see this, wonder "I don't
>  know this document type", and use the "classic" HTML parsing mode as a
>  default. This is what the warning:
>  [[
>  The MIME Media Type (text/html) for this document is used to serve both
>  SGML and XML based documents, and it is not possible to disambiguate it
>  based on the DOCTYPE Declaration in your document. Parsing will continue
>  in SGML mode.
>  ]] is about.
>
>  Because the FPI is unknown, the validator will use the system
>  identifier, download the DTD, and validate.
>
>  The errors came from the fact that the document is XHTML-ish in nature,
>  so some constructs are not OK when parsed as HTML.
>
>
>  In the new validator, there is a new mechanism to detect XML-based
>  documents if an XML declaration is present. Which is the case for your
>  document:
>  <?xml version="1.0" encoding="UTF-8"?>
>  so the validator 0.8.0 triggers the XML mode, and validates (again,
>  since the FPI is bogus... the validator uses the SI) It works, but the i
>  document is not "valid XHTML 1.0 Strict", because it does not properly
>  declares itself as XHTML 1.0.
>
>
> Does one of you know the reason of this change and the logic behind it ?
>
>  The validator now understands it has to use XML parsing when it sees an
>  XML declaration.
>
>  The moral of the story is: never write the DOCTYPE yourself.
>  Tools should do that, or copy it from
>  http://www.w3.org/QA/2002/04/valid-dtd-list.html
>
>
>


-- 
Philip
http://NikitaTheSpider.com/
Whole-site HTML validation, link checking and more

Received on Sunday, 12 August 2007 15:46:02 UTC