W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2007

Re: Tidy- char encoding problem

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sat, 14 Apr 2007 17:17:32 +0200
To: <chudziu@op.pl>
Cc: <html-tidy@w3.org>
Message-ID: <02s123hp9sjenebgs6s421c321l7co5htl@hive.bjoern.hoehrmann.de>

* <chudziu@op.pl> wrote:
>but when I want parse my text, output file has got an big5 encoding,
>even input text has got ISO enc. How can I change it? How can I apply
>to tidy iso-8859-2 (latin2) encoding? Is tidy suport it?

I doubt what you get is Big5 since you didn't specify that anywhere.
Tidy does not support Iso-8859-2, you have to use some other library
or custom code to convert to, say, UTF-8 first and afterwards.
Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de
Weinh. Str. 22 · Telefon: +49(0)621/4309674 · http://www.bjoernsworld.de
68309 Mannheim · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 
Received on Saturday, 14 April 2007 15:17:35 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:56 UTC