Tidy uses undf. entities with -xml

Hi,

If I've got a XML document that includes e.g. an   UTF-8 encoded,
and i tidy it up with -xml I get the HTML entity   instead of the
korrekt UTF-8 sequence. The XML document isn't wellformed any longer and
therefore unusable. Using the -utf8 command line argument doesn't change
this behaivour.

Tidy must include something like

<!DOCTYPE html [
  <!ENTITY nbsp "&#160;">
]>

or use UTF-8 encoding or numeric character references when in XML mode.
-- 
Björn Höhrmann ^ mailto:bjoern@hoehrmann.de ^ http://www.bjoernsworld.de
am Badedeich 7 ° Telefon: +49(0)4667/981028 ° http://bjoern.hoehrmann.de
25899 Dagebüll # PGP Pub. KeyID: 0xA4357E78 # http://learn.to/quote [!]e
<x>&#73; &#x2665; &#x2640;, &#x266B; &#x26; &#88;&#77;&#76; &#x263A;</x>

Received on Sunday, 14 January 2001 13:33:55 UTC