W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2000

problem with Tidy and some Frontpage html

From: Drew Burton <drb@ams.org>
Date: Tue, 5 Sep 2000 12:26:48 -0400 (EDT)
Message-Id: <200009051626.MAA0000024051@mr1.mr.ams.org>
To: html-tidy@w3.org
Cc: drb@ams.org
First I must say that we greatly appreciate Tidy and, as best as I can
enforce it, we use it to clean up the pages of MathSciNet from the Math

I hesitate to call this a bug but Tidy is doing bad by some bad html from
Frontpage 4.0.

Tidy throws away a name-anchor when it is represented as:
"<P><A NAME="clipboard"><H3>Clipboard</H3></A><P>"

That becomes "<H3>Clipboard</H3>".

I'm running as new a version as I could easily find.  For completeness I'm
including a) how it runs, b) the test input file and c) the output file.


   Drew Burton
   Ann Arbor, Michigan
   American Mathematical Society


a) how it runs
Tidy (vers 30th April 2000) Parsing "frontpage.html"
line 10 column 24 - Warning: missing </a> before <h3>
line 10 column 24 - Warning: trimming empty <p>
line 10 column 42 - Warning: discarding unexpected </a>

"frontpage.html" appears to be HTML 4.01 Transitional
3 warnings/errors were found!

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<META NAME="generator" CONTENT="HTML Tidy, see www.w3.org">
<META NAME="generator" CONTENT="Microsoft FrontPage 4.0">
A fragment of htmp created by Frontpage.....
<P><A NAME="clipboard"><H3>Clipboard</H3></A><P>

lots more stuff deleted....


c) output file

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<meta name="generator" content="HTML Tidy, see www.w3.org">
<meta name="generator" content="Microsoft FrontPage 4.0">
A fragment of htmp created by Frontpage..... 


<p>lots more stuff deleted....</p>
Received on Tuesday, 5 September 2000 13:54:40 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:48 UTC