W3C home > Mailing lists > Public > www-style@w3.org > May 2011

RE: [css3-text]   as whitespace WAS White space defined

From: Belov, Charles <Charles.Belov@sfmta.com>
Date: Wed, 18 May 2011 10:04:04 -0700
Message-ID: <E17F75B6E86AE842A57B4534F82D0376CD29A8@MTAMAIL.muni.sfgov.org>
To: "www-style list" <www-style@w3.org>
Corrected to add back missing link inadvertently omitted and
clarify a real-world use case. 

> -----Original Message-----
Charles Belov wrote on May 09, 2011 11:28 AM
text-decoration-skip [1]
> defines white space as including nbsp (U+00A0).  
White space rules [2] 
> defines white space as not including nbsp.
> Word-to-filtered-HTML conversion tends to include a lot of 
> extra nbsp's that I don't want
due to kludges that document creators made to obtain particular
formatting in the Word doc that are not appropriate when turned
into HTML.  (That is, the online website style is not intended
to keep the Word formatting.)

For example, the Word document creator may insert extra tabs
to make certain content indent, rather than using Word's
automatic indent feature.  The extra tabs get turned into 
nbsp's during conversion to HTML.

While, ideally, Word document creators would be trained 
to use Word's automatic indent feature, it's not likely to 

>  I would like to be able to 
> specify that bikeshedding also included nbsp's, or, if you 
> wanted to make it more generic, the ability to specify a list 
> of characters I also wanted bikeshedding to apply to.
> Yes, I know ideally removal of extra nbsp's from the source 
> document is something that I would do, but it is currently a 
> manual process and time-consuming.  If nbsp is good enough 
> for text-decoration-skip, why not make it available to other 
> properties?
[1] http://www.w3.org/TR/css3-text/#text-decoration-skip0
[2] http://www.w3.org/TR/css3-text/#white-space-rules

Hope this helps,
Charles Belov
SFMTA Webmaster
Received on Wednesday, 18 May 2011 17:28:04 UTC

This archive was generated by hypermail 2.3.1 : Monday, 2 May 2016 14:38:46 UTC