W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2002

Re: non ascii characters transformed to whatever

From: Charles Reitzel <creitzel@rcn.com>
Date: Thu, 11 Jul 2002 10:43:43 -0400
Message-Id: <4.3.2.7.2.20020711103849.02faf200@pop.rcn.com>
To: Fernando Nuņez <fernando.nunez@pildo.com>
Cc: <html-tidy@w3.org>

Hi Fernando,

Did you try -latin1 to specify you have extended chars?  If the characters 
are Latin1 (ISO-8859-1) symbols, then Tidy should read the raw characters 
and emit nice HTML entities (either numeric or symbolic).

If your symbol characters are in some other encoding (Windows-1252? Mac 
Roman?), you may still be able to get Tidy to recognize them.

See http://tidy.sf.net/docs/quickref.html#char-encoding

If that doesn't work, send us a small sample.

take it easy,
Charlie

At 06:32 PM 7/8/2002 -0400, Fernando Nuņez wrote:




>Hi all,
>First contribution/question.....
>When I run tidy on some HTML pages which contain non ascii characters
>(science equations, basically), they are transformed to a string of
>strange characters (a sequence of non ascii and ascii characters). As
>this is only one step into a, in theory, automatic process, we cannot
>afford re-changing it by hand.
>Does any of you happened to know whether it is possible to "tell" tidy
>not to transform this kind of characters.
>It is really driving me crazy....
>Thanks in advance
>Fernando
Received on Thursday, 11 July 2002 10:38:41 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:52 GMT