W3C home > Mailing lists > Public > html-tidy@w3.org > October to December 2005

Tidy does fancy entity replacements, but it shouldn't

From: Timo Hummel <timo.hummel@4fb.de>
Date: Mon, 21 Nov 2005 14:20:25 +0100
Message-ID: <63520597EA0F064392752B01D425188C023888@droop.headoffice.4fb.de>
To: <html-tidy@w3.org>

Hi everybody,

I'm using tidy to clean up partial HTML documents. However, tidy messes up some entities where it shouldn't. Example:

This is a testcase where I need to &auml; stay &auml; and  should stay 

I played around with the tidy options, but either &auml; is converted to ,  is converted to &auml; or both &auml; and  are converted to &#228;. 

Is there an option to ignore entities? I just need to clean up broken elements (e.g. incorrect nesting).

With best regards,
Timo A. Hummel

----
four for business AG

Lilistrasse 83/C  |  63067 Offenbach
phone: +49 69 801082-0  |  fax: +49 69 801082-79
mail: timo.hummel@4fb.de  |  web: http://www.4fb.de	
Received on Monday, 21 November 2005 15:03:40 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:55 GMT