W3C home > Mailing lists > Public > html-tidy@w3.org > October to December 2002

Re: Break tags in paragraphs.

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Fri, 11 Oct 2002 02:35:56 +0200
To: Lee Passey <lee@dysfunctionals.org>
Cc: "tidy-develop@lists.sourceforge.net" <tidy-develop@lists.sourceforge.net>, Tidy Main List <html-tidy@w3.org>
Message-ID: <3dca1b5f.100294255@smtp.bjoern.hoehrmann.de>

* Lee Passey wrote:

*Please* never ever cross-post between html-tidy@w3.org and
tidy-develop@lists.sf.net.

>A popular OCR program is ABBYY FineReader, which can save its output as 
>HTML.  Unfortunately, it has the bad habit of adding a <br /> tag to the 
>end of every paragraph, e.g.:
>
><p>This is an OCR'd paragraph.<br /></p>
>
>This additional <br /> tag doesn't seem to have any effect on the 
>presentation in any of the User Agents I have tested it with.

Most of the time it does not, yes.

>May I modify the Tidy core code to drop the <br /> tag when it appears 
>at the end of a paragraph?

We have a feature request by Claus Färber for this and other <br>
related issues, see

  https://sourceforge.net/tracker/index.php?func=detail&aid=611171&group_id=27659&atid=390966

I think to create just a special case for <p>...<br /></p> is not
enough, we can remove it at the end of all block level elements and
should, as Claus proposes, move them outside inline elements if they
occur just before the closing tag of the inline element.

So, in general I agree with removing <br />s at such places, but I'd
like to see a better thought out solution than just this <p><br> special
case.

Please follow up to tidy-develop@lists.sf.net only!

regards.
Received on Thursday, 10 October 2002 20:35:49 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 5 February 2014 23:39:48 UTC