W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2000

RE: tidy with MSXML parser

From: Julian Reschke <reschke@medicaldataservice.de>
Date: Wed, 31 May 2000 18:42:15 +0200
To: "Eli" <Eli@wapcominc.com>, <html-tidy@w3.org>
Message-ID: <NCBBIPMOPKLLGKJPBINCMEGEEFAA.reschke@medicaldataservice.de>
Hi,

to get a compliant XML document, you need to specify "-asxml" and
"-numeric".

Julian

> -----Original Message-----
> From: html-tidy-request@w3.org [mailto:html-tidy-request@w3.org]On
> Behalf Of Eli
> Sent: Wednesday, May 31, 2000 7:26 PM
> To: 'html-tidy@w3.org'
> Subject: tidy with MSXML parser
>
>
> Hi all.
>
> I am pretty new in the xml scope.
>
> I have made a very simple experiment:
>
> 1) Took an HTML page form MSDN help
> 2 )clicked: "view source" got the source into a notepad document.
> 3) sent this HTML to tidy - no changes in the default config parameters.
> 4) Got a "tidy converted" page which should be compliance to the XML
> standard.
>    (the "tidy" tag was into the page - which means that dity has been
> working on it , no significant errors in the errors.txt )
> 5) run a very simple VB program which loads an HTMLpage into a DOM
> object
> 6) here is the most significant code:
>
> Dim xmlDoc As New MSXML.DOMDocument
> xmlDoc.Load ("c:\tidy\msdn_help.html)
>
> 7) the MSXML parser failed while loading this page. Some of the errors
> were reasonable and easy to fix but the other (see below) caused me a
> real problem.
> 9) Here are the main errors:
>
> 	Line like this                             <link type="text/css"
> disabled rel="stylesheet" href= "stylesheets/msdn_ie3.css">
> 	caused the error: 		Missing equals sign between
> attribute and attribute value
>
>
> 	Every line like this:                     <meta name= "( any
> string. e.g)">
> 	caused the error:                      End tag (something. e.g.
> ) does not match the start tag 'meta'
>
>
> 	Every line like this:                     <li>Click the
> <b>Options</b> button.<br>
> 					<br>
> 				</li>
> 	caused the error:                     End tag 'li' does not
> match the start tag 'br'
>
> 	Every line like this :                  the Msjaval.dll as the
> control&resqu server, with the name and path...
> 	caused the error:                    reference to undefined
> resqu
>
> 	I also got errors like this :         A string literal was
> expected, but no opening quote character was found
>
> 	few tags were missing in the end of the file, (somewhere there
> was  a <tag> without </tag>...)
>
> 8) After fixing all the errors the page was successfully loaded to the
> parser.
>
>
>
> My question is: is tidy and MSXML are compliance to the same standard
> (XML 1.0 ) ? Is this should be possible to load an HTML page after
> working with tidy on it?
> Did I something wrong ?
>
>
> Any Idea will help me a lot.
>
>
> Thanks in advance
>
>
> Eli.
>  WAPCOM incorporation
>
>
>
>
>
>
Received on Wednesday, 31 May 2000 12:42:17 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:43 GMT