W3C home > Mailing lists > Public > www-html@w3.org > November 1998

Re: tidy -asxml fix

From: Dave Raggett <dsr@w3.org>
Date: Mon, 9 Nov 1998 16:15:50 +0000 ()
To: Dan Connolly <connolly@w3.org>
cc: www-html@w3.org
Message-ID: <Pine.WNT.3.95.981109161132.-29207R-100000@hazel.hpl.hp.com>
On Thu, 5 Nov 1998, Dan Connolly wrote:

> Dan Connolly wrote:
> > 
> > The tidy[1] -asxml feature is a pretty cool idea,
> > but it's broken in the 1Sep release[2].
> > 
> > [1] http://www.w3.org/People/Raggett/tidy/
> > [2] http://www.w3.org/People/Raggett/tidy01sep98.tgz
> [...]
> > Also, the XML declaration should be
> >         -- nothing if the encoding is UTF-8 (or US-ASCII) or UTF-16
> >         -- <?xml encoding="iso-8859-1" version="1.0">
> >                 if the tidy output is -latin1
> >                 and similar for -iso2022, but I don't know the
> >                 details.
> > 
> > So FixDocType should take another argument for the encoding.
> > I haven't hacked that up yet, but it should be easy.
> 
> OK... done. patch attached. (turns out the encoding is
> a global variable, so I dind't have to add an argument.)

Great - thanks for the patch. I plan to beef up the -asxml mode
as W3C progresses the details of how to XMLize HTML 4.0 documents.
The HTML working group is developing guidelines for this covering
both html aware clients and generic xml clients.

> The patch also adds -ansi to the gcc invocation;
> I got some warnings about redeclaration of uint in platform.h
> on linux, and this fixed it. But it made the // style comments
> generate errors, so I supplimented them with #if 0/#endif.

Thanks. I am working on a major new release of tidy and hope
to issue it in early December.

Regards,

-- Dave Raggett <dsr@w3.org> http://www.w3.org/People/Raggett
tel/fax: +44 122 578 2984 (or 2521) +44 385 320 444 (gsm mobile)
World Wide Web Consortium (on assignment from HP Labs)
Received on Monday, 9 November 1998 11:13:02 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:15:37 GMT