W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2001

Re: Tidy uses undf. entities with -xml

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sat, 14 Apr 2001 00:59:08 +0200
To: "Randy Waki" <rwaki@flipdog.com>
Cc: html-tidy@w3.org
Message-ID: <941fdt0q090tuohgrofkremrsdbj6us5sj@4ax.com>
* Randy Waki wrote:
>> If I've got a XML document that includes e.g. an &nbsp; UTF-8 encoded,
>> and i tidy it up with -xml I get the HTML entity &nbsp; instead of the
>> korrekt UTF-8 sequence. The XML document isn't wellformed any longer and
>> therefore unusable. Using the -utf8 command line argument doesn't change
>> this behaivour.
>
>Try -asxml.  The -xml option actually tells Tidy that the *input* is XML
>while -asxml tells Tidy to *output* XML.

Doesn't help; btw. if the input is XML, the output must be XML, too.
-- 
Björn Höhrmann { mailto:bjoern@hoehrmann.de } http://www.bjoernsworld.de
am Badedeich 7 } Telefon: +49(0)4667/981028 { http://bjoern.hoehrmann.de
25899 Dagebüll { PGP Pub. KeyID: 0xA4357E78 } http://www.learn.to/quote/
Received on Friday, 13 April 2001 18:58:07 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:45 GMT