W3C home > Mailing lists > Public > html-tidy@w3.org > January to March 2001

RE: Tidy uses undf. entities with -xml

From: Randy Waki <rwaki@flipdog.com>
Date: Sun, 14 Jan 2001 16:05:49 -0700
To: "Bjoern Hoehrmann" <derhoermi@gmx.net>, <html-tidy@w3.org>
Message-ID: <000401c07e7e$88e015e0$b665a8c0@rwaki>
Bjoern Hoehrmann wrote:
> 
> If I've got a XML document that includes e.g. an &nbsp; UTF-8 encoded,
> and i tidy it up with -xml I get the HTML entity &nbsp; instead of the
> korrekt UTF-8 sequence. The XML document isn't wellformed any longer and
> therefore unusable. Using the -utf8 command line argument doesn't change
> this behaivour.

Try -asxml.  The -xml option actually tells Tidy that the *input* is XML
while -asxml tells Tidy to *output* XML.

- Randy
Received on Sunday, 14 January 2001 18:03:56 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:45 GMT