W3C home > Mailing lists > Public > whatwg@whatwg.org > May 2009

[whatwg] converting word (was <code> attributes

From: Bruce Lawson <brucel@opera.com>
Date: Fri, 01 May 2009 12:27:24 +0100
Message-ID: <op.us80ryrch8on37@bruce-pc>
On Fri, 01 May 2009 12:22:32 +0100, Adrian Sutton  
<adrian.sutton at ephox.com> wrote:


> The biggest challenge in this is actually removing the huge amount of  
> inline
> formatting and proprietary tags/attributes that Microsoft Word adds.  In  
> the
> latest versions it's also a challenge to put lists back together as  
> actual
> HTML lists since Word has started exporting them as paragraphs with a  
> bullet
> from the symbol font and lots of nbsps.

Off topic, I know - but couldn't a VBA macro hook into word and actually  
make an "export as semantic html" option that exported the heading levels  
as h1..h6, honoured bold, italics, links, bullets and numbers as ul and  
ol, and just ignored all colours, font changes etc. So there is nothing to  
clean up?

bruce
Received on Friday, 1 May 2009 04:27:24 UTC

This archive was generated by hypermail 2.3.1 : Monday, 13 April 2015 23:08:48 UTC