W3C home > Mailing lists > Public > whatwg@whatwg.org > May 2009

[whatwg] converting word (was <code> attributes

From: Charles McCathieNevile <chaals@opera.com>
Date: Fri, 01 May 2009 13:03:04 +0200
Message-ID: <op.us8znexnwxe0ny@widsith.local>
On Thu, 30 Apr 2009 13:25:17 +0200, Kristof Zelechovski  
<giecrilj at stegny.2a.pl> wrote:

> Automatic conversion from Microsoft Word to HTML is doomed to fail  
> because the document models and the requirements are different.
> The best you can get is a tree of DIVs and SPANs with Word-
> specific classes.  Anything better needs a serious and thoughtful
> remake by the editor.

This is an oversimplification to the point of being misleading.

There are many ways to use Word, and many people and organisations with  
haf a clue use it in such a way that automatic conversion can be  
relatively easily used to generate highly semantically rich and valid  
markup - much better than the sort of tripe one typically finds on the Web  

Word enabled the clear semantic markup of documents before HTML even  
existed, and people who use it like that are in a good position to produce  
HTML that is much better quality than most developers come up with. It  
also enables presentation-driven markup to be converted automatically to  
make the underlying semantics explicit (something that virtually no HTML  
tool outside those which convert from another input format manages to do).

This stuff is not rocket science, either. It is the sort of thing that was  
routine more than a decade ago when people started to use HTML as well as,  
or instead of, Word and similar formats.



Charles McCathieNevile  Opera Software, Standards Group
     je parle fran?ais -- hablo espa?ol -- jeg l?rer norsk
http://my.opera.com/chaals       Try Opera: http://www.opera.com
Received on Friday, 1 May 2009 04:03:04 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:59:12 UTC