Good for you David. One of those threads was mine. We've been working around it by preprocessing to replace nbsp and other important entities with a marker phrase, "nbspmarker" etc. and post processing to put them back in. BTW - my humble thanks to Tidy programmers. It is a great tool that has helped as part of conversions of html to XHTML and DITA XML for XML content management systems. Richard -----Original Message----- From: html-tidy-request@w3.org [mailto:html-tidy-request@w3.org] On Behalf Of ml Sent: 06 November 2007 13:18 To: Bjoern Hoehrmann Cc: html-tidy@w3.org Subject: Re: missing With default encodings it will produce <meta...charset=us-ascii> and all nonascii chars will be converted to entities. Both is wrong. Why "quote-nbsp:true" doesn't take place? David Bjoern Hoehrmann napsal(a): > * ml wrote: >> I have following config: >> >> bare:false >> indent:false >> output-html:true >> doctype:transitional >> hide-comments:true >> wrap:0 >> quote-nbsp:true >> quote-marks:false >> quote-ampersand:true >> break-before-br:false >> char-encoding:raw >> input-encoding:raw >> output-encoding:raw > >> I found few threads discussing it but no solution. Tested versions >> are Ubuntu package "HTML Tidy for Linux/x86 released on 1 September 2005" >> and Pecl PHP Tidy 1.2. >> >> Any suggestion to preven Tidy from removing " "? I need UTF-8 >> input/output. > > Then why do you specify 'raw'? Can you reproduce this problem with > only the default options?Received on Tuesday, 6 November 2007 21:15:42 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 24 September 2008 09:20:25 GMT