W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2001

RE: Tidy uses undf. entities with -xml

From: Randy Waki <rwaki@flipdog.com>
Date: Fri, 13 Apr 2001 21:51:16 -0600
To: "Bjoern Hoehrmann" <derhoermi@gmx.net>
Cc: <html-tidy@w3.org>
Bjoern Hoehrmann wrote:
> * Randy Waki wrote:
> >> If I've got a XML document that includes e.g. an &nbsp; UTF-8 encoded,
> >> and i tidy it up with -xml I get the HTML entity &nbsp; instead of the
> >> korrekt UTF-8 sequence. The XML document isn't wellformed any longer and
> >> therefore unusable. Using the -utf8 command line argument doesn't change
> >> this behaivour.
> >
> >Try -asxml.  The -xml option actually tells Tidy that the *input* is XML
> >while -asxml tells Tidy to *output* XML.
> Doesn't help; btw. if the input is XML, the output must be XML, too.

Right you are.  Sorry, I wasn't paying close enough attention to my options.
Try -numeric (it tells Tidy to output numeric character references).

- Randy
Received on Friday, 13 April 2001 23:54:47 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:49 UTC