A popular OCR program is ABBYY FineReader, which can save its output as HTML. Unfortunately, it has the bad habit of adding a <br /> tag to the end of every paragraph, e.g.: <p>This is an OCR'd paragraph.<br /></p> This additional <br /> tag doesn't seem to have any effect on the presentation in any of the User Agents I have tested it with. May I modify the Tidy core code to drop the <br /> tag when it appears at the end of a paragraph?Received on Thursday, 10 October 2002 14:18:19 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:37:53 GMT