W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2001

Re: error when clean html with tidy (fwd)

From: Gary L Peskin <garyp@firstech.com>
Date: Thu, 23 Aug 2001 23:28:50 -0700
Message-ID: <3B85F422.9F4481DD@firstech.com>
To: Chunbo Shao <cxs0187@omega.uta.edu>
CC: html-tidy@w3.org
Please show how you're invoking Tidy and what the exact error messages
are.

Thanks,
Gary


Chunbo Shao wrote:
> 
> thanks for reply.
> 
> Attached files are some html page on some university. I didn't make the
> page. I just use Tidy to parse it.
> 
> The file "48-washington.edu" gave the error "Error: <meta> missing '>' for
> end of tag".
> 
> The file "42-upenn.edu" gave me the error "Error: <a> missing '>' for end
> of tag".
> 
> At the beginning of each file, you can see the url link address for this
> url file. I already took out these extra lines before I use Tidy to clean
> this url file.
> 
> I cannot see any clue to figure out why the error happans.
> thanks for help.
> 
> chunbo
> 
> On Thu, 23 Aug 2001, Reitzel, Charlie wrote:
> 
> > Can you send a snippet of your HTML w/ the <meta> and <a> tags that Tidy is
> > complaining about?  You may unbalanced quotes or some other problem that has
> > confused it.
> >
> > -----Original Message-----
> > From: Chunbo Shao [mailto:cxs0187@omega.uta.edu]
> > Sent: Thursday, August 23, 2001 5:43 PM
> > To: mrbannon@student.math.uwaterloo.ca
> > Cc: html-tidy@w3.org
> > Subject: error when clean html with tidy (fwd)
> >
> >
> > Hi,
> >
> > almost same thing, error shows
> > "<meta> missing '>' for end of tag".
> >
> > But, "meta" is already in TagTable.java.
> >
> > Can we do something (to make tidy) to solve this problem, then to give
> > nice output other than zero-content file?
> >
> > thanks.
> >
> > Chunbo
> >
> >
> > ---------- Forwarded message ----------
> > Date: Thu, 23 Aug 2001 16:23:43 -0500 (CDT)
> > From: Chunbo Shao <cxs0187@omega.uta.edu>
> > To: Michael Ryan Bannon <mrbannon@student.math.uwaterloo.ca>
> > Cc: html-tidy@w3.org
> > Subject: error when clean html with tidy
> >
> > Hi, Michael
> >
> > Thanks for your help on "config.txt". It's good solution.
> >
> > When i run tidy to clean some html, i got one error indicating that
> > "<a> missing '>' for end of tag ". But "<a>" is already included in
> > TagTable.java.
> > Because of this error, the output as clean result is a zero-length file.
> > But i want the output file not to be a zero-content file.
> >
> > Is there any solution to avoid this? Tidy is supposed to overcome this
> > case, is it?
> >
> > chunbo
> >
> >
> >
> 
>   ------------------------------------------------------------------------
>                         Name: 48-washington.edu
>    48-washington.edu    Type: Plain Text (TEXT/PLAIN)
>                     Encoding: BASE64
> 
>                    Name: 42-upenn.edu
>    42-upenn.edu    Type: Plain Text (TEXT/PLAIN)
>                Encoding: BASE64
Received on Friday, 24 August 2001 02:29:33 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:46 GMT