W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2001

RE: Tidy and WordXP

From: Jason Manaigre <jmanaigre@iisd.ca>
Date: Thu, 13 Sep 2001 15:17:34 -0500
Message-ID: <89CD4E678F22EA44ABCDE49C8010176F167B57@marathon.iisd.ca>
To: <html-tidy@w3.org>
Hi guys, yeah I pasted a file exported from wordXP into an .htm file.

We deal with lots of documents like this and it's nice to export simple
files and clean them up, but XP is doing something nasty...

The error I get is :

Error in document has prevented cleanup:

Tidy (vers 4th August 2000) Parsing "input.html"
line 1 column 1 - Warning: unknown attribute "xmlns:st1"
line 1 column 1 - Warning: unknown attribute "xmlns:dt"
line 1 column 1 - Warning: unknown attribute "xmlns:w"
line 1 column 1 - Warning: unknown attribute "xmlns:o"
line 1 column 1 - Warning: unknown attribute "xmlns:v"
line 24 column 1 - Error: o:smarttagtype is not recognized!
line 24 column 1 - Warning: discarding unexpected o:smarttagtype
line 26 column 1 - Error: o:smarttagtype is not recognized!
line 26 column 1 - Warning: discarding unexpected o:smarttagtype

Tried grabbing a Win32 build and got an error page...will try again
later...

Thanks for the help..

-----Original Message-----
From: Reitzel, Charlie [mailto:CReitzel@arrakisplanet.com] 
Sent: Thursday, September 13, 2001 3:08 PM
To: Jason Manaigre; html-tidy@w3.org
Subject: RE: Tidy and WordXP


Hi Jason,

Can you post a sample file?  I don't know what the WordXP HTML output
looks like.  FYI, we have fixed a couple bugs and generally made
Word2000 cleanup
more robust.   You can grab a binary at http://tidy.sourceforge.net.


_________________________________

<html xmlns:v="urn:schemas-microsoft-com:vml"
xmlns:o="urn:schemas-microsoft-com:office:office"
xmlns:w="urn:schemas-microsoft-com:office:word"
xmlns:dt="uuid:C2F41010-65B3-11d1-A29F-00AA00C14882"
xmlns:st1="urn:schemas-microsoft-com:office:smarttags"
xmlns="http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=Content-Type content="text/html; charset=windows-1252">
<meta name=ProgId content=Word.Document>
<meta name=Generator content="Microsoft Word 10">
<meta name=Originator content="Microsoft Word 10">
<link rel=File-List
href="Corporate%20Reporting%20Page2_files/filelist.xml">
<link rel=Edit-Time-Data
href="Corporate%20Reporting%20Page2_files/editdata.mso">
<!--[if !mso]>
<style>
v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style>
<![endif]-->
<title>Corporate Reporting Page</title>
<o:SmartTagType
namespaceuri="urn:schemas-microsoft-com:office:smarttags"
 name="place"/>
<o:SmartTagType
namespaceuri="urn:schemas-microsoft-com:office:smarttags"
 name="country-region"/>
<!--[if gte mso 9]><xml>
 <o:DocumentProperties>
  <o:Author>sara hollett</o:Author>
  <o:LastAuthor>jmanaigre</o:LastAuthor>
  <o:Revision>2</o:Revision>
  <o:TotalTime>116</o:TotalTime>
  <o:Created>2001-09-13T20:12:00Z</o:Created>
  <o:LastSaved>2001-09-13T20:12:00Z</o:LastSaved>
  <o:Pages>1</o:Pages>
  <o:Words>1267</o:Words>
  <o:Characters>7228</o:Characters>
  <o:Company>iisd</o:Company>
  <o:Lines>60</o:Lines>
  <o:Paragraphs>16</o:Paragraphs>
  <o:CharactersWithSpaces>8479</o:CharactersWithSpaces>
  <o:Version>10.2625</o:Version>
 </o:DocumentProperties>
 <o:CustomDocumentProperties>
  <o:_AdHocReviewCycleID
dt:dt="float">1001166697</o:_AdHocReviewCycleID>
  <o:_EmailSubject dt:dt="string">BSD Update</o:_EmailSubject>
  <o:_AuthorEmail dt:dt="string">shollett@iisd.ca</o:_AuthorEmail>
  <o:_AuthorEmailDisplayName dt:dt="string">Sara
Hollett</o:_AuthorEmailDisplayName>
  <o:_ReviewingToolsShownOnce
dt:dt="string"></o:_ReviewingToolsShownOnce>
 </o:CustomDocumentProperties>
</xml><![endif]--><!--[if gte mso 9]><xml>
 <w:WordDocument>
  <w:Compatibility>
   <w:BreakWrappedTables/>
   <w:SnapToGridInCell/>
   <w:WrapTextWithPunct/>
   <w:UseAsianBreakRules/>
  </w:Compatibility>
  <w:BrowserLevel>MicrosoftInternetExplorer4</w:BrowserLevel>
 </w:WordDocument>
</xml><![endif]--><!--[if !mso]><object
 classid="clsid:38481807-CA0E-42D2-BF39-B33AF135CC4D"
id=ieooui></object>
<style>
st1\:*{behavior:url(#ieooui) }
</style>
<![endif]-->

<!--[if gte mso 10]>
<style>
 /* Style Definitions */
 table.MsoNormalTable
	{mso-style-name:"Table Normal";
	mso-tstyle-rowband-size:0;
	mso-tstyle-colband-size:0;
	mso-style-noshow:yes;
	mso-style-parent:"";
	mso-padding-alt:0in 5.4pt 0in 5.4pt;
	mso-para-margin:0in;
	mso-para-margin-bottom:.0001pt;
	mso-pagination:widow-orphan;
	font-size:10.0pt;
	font-family:"Times New Roman";}
</style>
<![endif]--><!--[if gte mso 9]><xml>
 <o:shapedefaults v:ext="edit" spidmax="2050"/>
</xml><![endif]--><!--[if gte mso 9]><xml>
 <o:shapelayout v:ext="edit">
  <o:idmap v:ext="edit" data="1"/>
 </o:shapelayout></xml><![endif]-->
</head>

<body lang=EN-US link=blue vlink=purple style='tab-interval:.5in'>

<div class=Section1>

<p class=MsoNormal>Corporate Reporting Page</p>

<p class=MsoNormal><o:p>&nbsp;</o:p></p>

<p class=MsoNormal><a
href="http://test.iisd.ca/business/corpreport.htm">http://test.iisd.ca/b
usiness/corpreport.htm</a></p>

<p class=MsoNormal><o:p>&nbsp;</o:p></p>

<p class=MsoNormal><o:p>&nbsp;</o:p></p>


</div>

</body>

</html>
Received on Thursday, 13 September 2001 16:17:36 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:46 GMT