W3C home > Mailing lists > Public > www-validator@w3.org > May 2002

Re: crytpic error in nsgmls validation from emacs

From: Terje Bless <link@pobox.com>
Date: Wed, 29 May 2002 21:52:40 +0200
To: "Michael D. Crawford" <crawford@goingware.com>
cc: www-validator@w3.org
Message-ID: <r01050200-1014-493DDF5A733E11D6ABA800039300CF5C@[192.168.1.7]>

Michael D. Crawford <crawford@goingware.com> wrote:

>nsgmls:/usr/share/sgml/declaration/html.decl:23:32:W: characters in the
>document character set with numbers exceeding 65535 not supported

The nsgmls on your box doesn't support 16bit character sets, but the SGML
Declaration specified is listing the first part of UNICODE as the Document
Character Set. You need to either futz the SGML Declaration (bad idea, but
might be a workable bandaid) or get a better SGML parser.

OpenSP http://openjade.sf.net/ builds with semi-decent UNICODE support by
default in recent versions. There is a "Mumble"BSD port of it and it
includes a RPM spec file so you can build an RPM, but is slightly painfull
to build owing to the wonderfully incompatible versions of autoconf that
are in circulation. If you run into trouble you might try posting to the
openjade-devel list (info on the SF project page).

FWIW, OpenSP is the SGML Parser we will use for Validator sometime in the
not too distant future. Currently we use a hacked up version (endless
thankyous to Liam Quinn for the patches!) of original SP and simply ignore
that error message you refer to above.

-- 
Ladies and gentlemen, you must resist those all-too-human feelings and decide
this case on the evidence.    And the evidence plainly shows that Mr. Landa's
injuries,   disfiguring as they are,  are nowhere near as important to a free
society as the fundamental right to make smart-ass remarks.   -- Katie @ AtAT
Received on Wednesday, 29 May 2002 15:59:42 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:03 GMT