- From: Jim Thatcher <jim@jimthatcher.com>
- Date: Tue, 23 Oct 2001 08:19:21 -0500
- To: w3c-wai-ig@w3.org
I fear that this may be diverging from the thread, but the subject is right on. I was converting a large Office document to (accessible and clean) html and very late in the process, someone told me that HTML-Kit had a specific menu option to clean up Word 2000 documents that had been stored in html, "Strip surplus tags in Word 2000 pages." That worked very well. Now from Kathleen Anderson's [kathleen.anderson@po.state.ct.us] CMAC Access list I hear Microsoft has a filter to do the same thing. It is also educational to read why the documents are so bloated. Here's the quote from Kathleen's message which was obviously quoting Microsoft: <QUOTE> http://office.microsoft.com/downloads/2000/Msohtmf2.aspx The Office HTML Filter is a tool you can use to remove Office-specific markup tags embedded in Office 2000 documents saved as Hypertext Markup Language (HTML). When you create an HTML document in Office 2000, Office-specific markup tags are embedded in it. These tags help "round-trip" the document for editing purposes. For example, if you create the document in Word 2000 and save it as HTML, the code embedded in the document allows you to re-open the document in Word 2000 and use the same features you originally used to create the page. Once you have completed editing an HTML document in Word 2000 or Excel 2000, you can use the Office HTML Filter to remove the Office-specific markup tags from the final copy of the HTML document. By removing the tags, you reduce the size of the document, which in turn reduces both the amount of space used on Web servers as well as the time it takes to download the page. For additional information about the benefits and disadvantages of removing the Office-specific markup tags, read Use Office HTML Filter to Create Web Pages that Download Faster. <endquote> Jim jim@jimthatcher.com Accessibility Consulting http://jimthatcher.com 512-306-0931
Received on Tuesday, 23 October 2001 09:21:16 UTC