W3C home > Mailing lists > Public > html-tidy@w3.org > January to March 2002

keeping \240 out, etc.

From: Dan Jacobson <jidanni@yahoo.com.tw>
Date: 21 Jan 2002 23:37:18 +0800
To: html-tidy@w3.org
Message-ID: <m2hepfzpe9.fsf@Mdan.jacobson.tw>
Please don't put the non ascii for a nbsp, i.e. a (0240, 160, 0xa0)
into the output, as when next to a <, as in <b>, it gets eaten assuming
a Chinese big5 character, in browsers.  Now I must do |tr -d \\240 ...

--bare yes --quote-nbsp yes didn't help with the dec01 version.

By the way, I use --doctype strict and --logical-emphasis and a non
strict <U> still gets thru.

!drop-font-tags
!
!This option specifies if Tidy should discard <FONT> and <CENTER> tags
!rather than creating the corresponding style rules, but only if the
!clean option is also set to yes.

OK, but what if we want to discard them anyway?

Tidy is smart enough to get rid of
<STRONG></STRONG>
but how about
</STRONG><STRONG>
hmm?

I notice that
<A  href="#_ftn2"
    name="_ftnref2"
    title="">[2]</A>
keeps title="" for some reason.

Is
--escape-cdata yes 
supposed to relieve me from doing
sed 's/<!\[if !supportFootnotes]>\(\[[0-9]\+\]\)<!\[endif]>/\1/g'
before tidy?
-- 
http://www.geocities.com/jidanni/ Taiwan(04)25854780
Received on Monday, 21 January 2002 10:38:51 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:51 GMT