W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2000

RE: tidy with MSXML parser

From: Merkey, Brett <Brett_Merkey@tvratings.com>
Date: Wed, 31 May 2000 13:05:01 -0400
Message-ID: <01D4D419B1A4D111A30400805FE65B13065FE851@nmrusdunsx1.nielsenmedia.com>
To: "'Eli'" <Eli@wapcominc.com>, "'html-tidy@w3.org'" <html-tidy@w3.org>
Sounds like the errors found are proper. For instance:

<link type="text/css" disabled rel="stylesheet" href=
"stylesheets/msdn_ie3.css"> 
	caused the error: Missing equals sign between attribute and
attribute value 

You have to create XHTML legal syntax. In this case, adding
disabled="disabled" is necessary.

Most of the others come from not ending META tags with /> (blank space,
forward slash, angle.

--Brett



-----Original Message-----
From: Eli [mailto:Eli@wapcominc.com]
Sent: Wednesday, May 31, 2000 1:26 PM
To: 'html-tidy@w3.org'
Subject: tidy with MSXML parser


Hi all.

I am pretty new in the xml scope. 

I have made a very simple experiment:

1) Took an HTML page form MSDN help 
2 )clicked: "view source" got the source into a notepad document.
3) sent this HTML to tidy - no changes in the default config parameters.
4) Got a "tidy converted" page which should be compliance to the XML
standard.
   (the "tidy" tag was into the page - which means that dity has been
working on it , no significant errors in the errors.txt )
5) run a very simple VB program which loads an HTMLpage into a DOM
object 
6) here is the most significant code:

Dim xmlDoc As New MSXML.DOMDocument
xmlDoc.Load ("c:\tidy\msdn_help.html) 

7) the MSXML parser failed while loading this page. Some of the errors
were reasonable and easy to fix but the other (see below) caused me a
real problem.
9) Here are the main errors:

	Line like this                             <link type="text/css"
disabled rel="stylesheet" href= "stylesheets/msdn_ie3.css"> 
	caused the error: 		Missing equals sign between
attribute and attribute value 


	Every line like this:                     <meta name= "( any
string. e.g)"> 
	caused the error:                      End tag (something. e.g.
) does not match the start tag 'meta'


	Every line like this:                     <li>Click the
<b>Options</b> button.<br>
					<br>
				</li> 
	caused the error:                     End tag 'li' does not
match the start tag 'br'  

	Every line like this :                  the Msjaval.dll as the
control&resqu server, with the name and path... 
	caused the error:                    reference to undefined
resqu 

	I also got errors like this :         A string literal was
expected, but no opening quote character was found 

	few tags were missing in the end of the file, (somewhere there
was  a <tag> without </tag>...)

8) After fixing all the errors the page was successfully loaded to the
parser.



My question is: is tidy and MSXML are compliance to the same standard
(XML 1.0 ) ? Is this should be possible to load an HTML page after
working with tidy on it?
Did I something wrong ?


Any Idea will help me a lot.


Thanks in advance 


Eli.
 WAPCOM incorporation
Received on Wednesday, 31 May 2000 13:05:55 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:43 GMT