W3C home > Mailing lists > Public > www-validator@w3.org > August 2004

SGML Id Tool (was: suggest validator prefer URI to FPI)

From: Sean B. Palmer <sean+wv@infomesh.net>
Date: Thu, 05 Aug 2004 23:05:02 +0100
Message-ID: <4112AF0E.5080401@infomesh.net>
To: www-validator@w3.org
Cc: Terje Bless <link@pobox.com>, Dan Connolly <connolly@w3.org>

> Patches gratefully accepted!

Though SP doesn't include external identifiers in its ESIS output, it
does provide access to them through its generic API, making it possible
to build a tool on top of it in C++ that outputs just the PubID and
SysID (if present) on two consecutive lines. And I've done just that.

All of the instructions and source are available at:

    http://inamidst.com/proj/sgmlid/

I suggest you grab sgmlid.tar.gz, unpack it, and then follow the
instructions in the README.txt file.

Here's an example of the kind of thing that it does:

[[[
$ ./sgmlid test.html 2> /dev/null
PUBLIC: -//W3C//DTD XHTML 1.0 Strict//EN
SYSTEM: http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd
]]]

Note that SP may raise various errors along the way that you have to
redirect off to /dev/null as shown.

I can release sgmlid under whichever open source license you require
should you want to incorporate it into the validator codebase. Even if,
as is most likely, you do not want to use the code, it at least
demonstrates that this problem is an extremely easy one to fix with
relatively little application.

Whatever the tool used, I suggest you simply peek at the PubId and
SysId, and if they don't match raise some kind of achingly obvious
"Fatal Warning" to users: I don't believe that defaulting to either ID
is acceptable given that inconsistency will never be intentional on the
behalf of the user or their authoring tool.

Thanks,

-- 
Sean B. Palmer, http://inamidst.com/
Received on Thursday, 5 August 2004 18:05:34 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 5 February 2014 07:14:08 UTC