- From: Bjoern Hoehrmann <derhoermi@gmx.net>
- Date: Sun, 14 Jan 2001 19:34:33 +0100
- To: html-tidy@w3.org
Hi, If I've got a XML document that includes e.g. an UTF-8 encoded, and i tidy it up with -xml I get the HTML entity instead of the korrekt UTF-8 sequence. The XML document isn't wellformed any longer and therefore unusable. Using the -utf8 command line argument doesn't change this behaivour. Tidy must include something like <!DOCTYPE html [ <!ENTITY nbsp " "> ]> or use UTF-8 encoding or numeric character references when in XML mode. -- Björn Höhrmann ^ mailto:bjoern@hoehrmann.de ^ http://www.bjoernsworld.de am Badedeich 7 ° Telefon: +49(0)4667/981028 ° http://bjoern.hoehrmann.de 25899 Dagebüll # PGP Pub. KeyID: 0xA4357E78 # http://learn.to/quote [!]e <x>I ♥ ♀, ♫ & XML ☺</x>
Received on Sunday, 14 January 2001 13:33:55 UTC