W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2000

Re: Problem with HTML Tidy: no encoding specified in XML output

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Mon, 4 Sep 2000 12:30:11 +0200
Message-ID: <001d01c0165b$1c3b7eb0$cccbb43e@de>
To: Mikael Ståldal <d96-mst-ingen-reklam@d.kth.se>
Cc: <html-tidy@w3.org>
* "Mikael Ståldal" <d96-mst-ingen-reklam@d.kth.se> wrote:
| When using HTML Tidy with the options -asxml -latin1, it doesn't output
|
| <?xml version="1.0" encoding="iso-8859-1"?>
|
| as it should in order to produce well-formed XML. Without the encoding
| specification, an XML parser will assume UTF-8.

Use '--add-xml-decl yes' but i agree, that tidy should do this automatically
(if there are iso-8859-1 characters in the file. If all chars are encoded as
entities it isn't necessary, beacause the file is us-ascii and us-ascii is a
subset of utf-8, the default encoding of XML files.)

regards,
--
Björn Höhrmann ^ mailto:bjoern@hoehrmann.de ^ http://www.bjoernsworld.de
am Badedeich 7 ° Telefon: +49(0)4667/981ASK ° http://bjoern.hoehrmann.de
25899 Dagebüll # PGP Pub. KeyID: 0xA4357E78 # http://learn.to/quote +{i}
Received on Monday, 4 September 2000 06:31:31 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:44 GMT