W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2008

Re: tidy fails parsing broken tag

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sun, 11 May 2008 14:19:57 +0200
To: Arvid Ephraim Picciani <aep@ibcsolutions.de>
Cc: html-tidy@w3.org
Message-ID: <c1pd2452m7nksr7i8vhllrfrpnp53de4ki@hive.bjoern.hoehrmann.de>

* Arvid Ephraim Picciani wrote:
>on some website i found:
>"<bobr>foobar</nobr>"
>
>tidy won't parse a bit of the entire page, any idea how to fix this?  i don't 
>need the correct tags, just the content.
>
>tidy is able to correct:
>"<nobr>foobar</bobr>"
>into
>"<nobr>foobar</nobr>"
>but not the other way round.
>
>I need perfectly valid xml at all costs, so i'm fine if some parts get lost.

Perhaps you are looking for the --force-output option, or maybe even one
of the --new-foo-tags options. The former will give you output even if
it is broken, the latter allows you to declare "bobr" as new element.
-- 
Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de
Weinh. Str. 22 · Telefon: +49(0)621/4309674 · http://www.bjoernsworld.de
68309 Mannheim · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 
Received on Sunday, 11 May 2008 12:20:54 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:59 GMT