W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2001

(unknown charset) RE: error when clean html with tidy (fwd)

From: (unknown charset) Chunbo Shao <cxs0187@omega.uta.edu>
Date: Fri, 24 Aug 2001 01:17:03 -0500 (CDT)
To: (unknown charset) "Reitzel, Charlie" <CReitzel@arrakisplanet.com>
cc: (unknown charset) html-tidy@w3.org
Message-ID: <Pine.OSF.4.10.10108240110390.18703-300000@omega.uta.edu>
thanks for reply.

Attached files are some html page on some university. I didn't make the
page. I just use Tidy to parse it.

The file "48-washington.edu" gave the error "Error: <meta> missing '>' for
end of tag".

The file "42-upenn.edu" gave me the error "Error: <a> missing '>' for end
of tag".

At the beginning of each file, you can see the url link address for this
url file. I already took out these extra lines before I use Tidy to clean
this url file.

I cannot see any clue to figure out why the error happans.
thanks for help.

chunbo


On Thu, 23 Aug 2001, Reitzel, Charlie wrote:

> Can you send a snippet of your HTML w/ the <meta> and <a> tags that Tidy is
> complaining about?  You may unbalanced quotes or some other problem that has
> confused it.
> 
> -----Original Message-----
> From: Chunbo Shao [mailto:cxs0187@omega.uta.edu]
> Sent: Thursday, August 23, 2001 5:43 PM
> To: mrbannon@student.math.uwaterloo.ca
> Cc: html-tidy@w3.org
> Subject: error when clean html with tidy (fwd)
> 
> 
> Hi, 
> 
> almost same thing, error shows
> "<meta> missing '>' for end of tag".
> 
> But, "meta" is already in TagTable.java.
> 
> Can we do something (to make tidy) to solve this problem, then to give
> nice output other than zero-content file?
> 
> thanks.
> 
> Chunbo
> 
> 
> ---------- Forwarded message ----------
> Date: Thu, 23 Aug 2001 16:23:43 -0500 (CDT)
> From: Chunbo Shao <cxs0187@omega.uta.edu>
> To: Michael Ryan Bannon <mrbannon@student.math.uwaterloo.ca>
> Cc: html-tidy@w3.org
> Subject: error when clean html with tidy
> 
> Hi, Michael
> 
> Thanks for your help on "config.txt". It's good solution.
> 
> When i run tidy to clean some html, i got one error indicating that
> "<a> missing '>' for end of tag ". But "<a>" is already included in
> TagTable.java.
> Because of this error, the output as clean result is a zero-length file.
> But i want the output file not to be a zero-content file.
> 
> Is there any solution to avoid this? Tidy is supposed to overcome this
> case, is it? 
> 
> chunbo
> 
> 
> 


Received on Friday, 24 August 2001 02:17:26 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:46 GMT