- From: David P. Heitmeyer <heitmey@fas.harvard.edu>
- Date: Fri, 7 Jan 2000 11:17:07 -0500 (EST)
- To: Brad Stowers <bstowers@pobox.com>
- cc: html-tidy@w3.org
I've experienced the problem with the DOCTYPE in XML as well. If it
helps, I observe the problem with the Nov 30 1999 version of tidy, but the
July 26 1999 version does not have this problem.
Example:
fas% lwp-request http://www.news.perl.org/perl-news-short.rdf
<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN"
"http://my.netscape.com/publish/formats/rss-0.91.dtd">
<rss version="0.91">
<channel>
<title>Perl News</title>
<link>http://www.news.perl.org/</link>
[...cut...]
Example of Tidy processing:
fas% lwp-requesthttp://www.news.perl.org/perl-news-short.rdf | tidy -xml
<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN"
"http://my.netscape.com/publish/formats/rss-0.91.dtd">
[eof]
-- David
David P. Heitmeyer
Web Systems Administrator
FAS Computer Services
email: heitmey@fas.harvard.edu
Phone: 617-495-1259
Fax: 617-495-1210
On Wed, 5 Jan 2000, Brad Stowers wrote:
> I'm trying to run an XML file through tidy and it seems to choke on the doctype (the output stops at that line). If I remove the line, the file is properly formatted. Unfortunately, I need the line in there. I could remove it, run it through tidy, and then put it back in, but I'd really prefer not to do that if I don't have to. I'm hoping I'm missing some config option or something, but I couldn't find it if I am. Anyway, here's what the beginning of the source file looks like:
>
> <?xml version="1.0" ?>
> <?fdl version="Beta 1"?>
> <!DOCTYPE fenestra-fdl SYSTEM "fenestra-fdl.dtd">
>
> <fenestra-fdl>
>
> <constraint-package>
>
> [...snip...]
>
>
> and after formatting:
>
> <?xml version="1.0"?><?fdl version="Beta 1"?>
> <!DOCTYPE fenestra-fdl SYSTEM "fenestra-fdl.dtd">
>
> [end of file]
>
>
> As I said, if I remove the !DOCTYPE tag, it will work fine. Any ideas?
>
> --Brad
>
Received on Friday, 7 January 2000 11:17:09 UTC