numeric entities dont survive -raw :-(

good thing I can send thru gmane.org, else:

>>>>> "M" == Mail Delivery Subsystem <MAILER-DAEMON@msr.hinet.net> writes:

M> The original message was received at Sat, 12 Oct 2002 17:07:19 +0800 (CST)
M> from 61-227-44-150.HINET-IP.hinet.net [61.227.44.150]

M>    ----- The following addresses had permanent fatal errors -----
M> <html-tidy@w3.org>

M>    ----- Transcript of session follows -----
M> ... while talking to w3.org.:
>>>> MAIL From:<jidanni@dman.ddts.net>
M> <<< 550 Access denied
M> 554 <html-tidy@w3.org>... Service unavailable
M> Reporting-MTA: dns; msr.hinet.net
M> Received-From-MTA: DNS; 61-227-44-150.HINET-IP.hinet.net
M> Arrival-Date: Sat, 12 Oct 2002 17:07:19 +0800 (CST)

M> Final-Recipient: RFC822; html-tidy@w3.org
M> Action: failed
M> Status: 5.0.0
M> Remote-MTA: DNS; w3.org
M> Diagnostic-Code: SMTP; 550 Access denied
M> Last-Attempt-Date: Sat, 12 Oct 2002 17:07:29 +0800 (CST)
M> From: Dan Jacobson <jidanni@dman.ddts.net>
M> Subject: Re: specifying numeric entities converted to hex vs. decimal
M> To: Bjoern Hoehrmann <derhoermi@gmx.net>
M> Cc: html-tidy@w3.org
M> X-Sent: 59 minutes, 40 seconds ago

>>>>> "B" == Bjoern Hoehrmann <derhoermi@gmx.net> writes:

B> * Dan Jacobson wrote:
>>> Tidy should have a way of specifying if one wants all  numeric
>>> entities converted to hex vs. decimal.  E.g. Bob wants all mine hex,
>>> Bill wants all decimal.

B> Why do you want hexadecimal character references? Browser support is
B> worse than for decimal character references. Hex char references may

M> for instance, i would like tidy to change any item over x0FF to hex
M> references --- easier for me to think about.  Then enable tidy to make
M> another copy for my website, with them all changes into decimal, to
M> work in more browsers.

B> save some bytes but if you care about that, you should probably use a
B> character encoding that does not require to use character references
B> at all. Hex char references may be easier to read for some people but
B> then there is still the browser support issue. It's easy to implement,
B> but I fear we would just be adding even more complexity to Tidy's
B> configuration options without real value for Tidy users.

M> OK, then at least allow my numeric entities to emerge unscathed if I
M> use -raw [which I must for big5 Chinese pages.]
M> Can you believe I must do
M> tidy file|sed 's/@\(x\?[0-9a-fA-F]\+\)@/\&#\1;/g;\
M>  s/<!-- Pre tidy source file -->/<!-- Dan: DO NOT EDIT this post-processed file -->/'
M> to keep tidy's hands off of them,
M> E.g. http://jidanni.org/lang/pinyin/19970607tai_ke.html
M> -- 
M> http://jidanni.org/ Taiwan(04)25854780

Received on Saturday, 12 October 2002 05:14:18 UTC