- From: <Charles.VILLEPREUX@oecd.org>
- Date: Mon, 3 Sep 2007 18:49:31 +0200
- To: <html-tidy@w3.org>
- Cc: <Pascale.CISSOKHO-MUTTER@oecd.org>, <Marion.DESMARTIN@oecd.org>, <Joseph.CROWTHER@oecd.org>
- Message-ID: <AB93230B9C912540A4CCE30839864B2607C439B6@EXCHANGEB.main.oecd.org>
Hi, Using "HTML Tidy", I would like to have explanation concerning: 1°) Why "HTML Tidy" does not manage properly the <sub> element within the <title> element ? HTML: <title>SourceOECD: Factbook 2007 - Emissions of carbon dioxide (CO<sub>2</sub>)</title> XHTML: <title>SourceOECD: Factbook 2007 - Emissions of carbon dioxide (CO</title> </head> <body> <sub>2</sub>) 2°) Why "HTML Tidy" does not convert • or • character ? My configuration for the encoding process: Input-encoding:ascii ; Output-encoding:utf8 When I open the XHTML page, the bullet does not appear. Thank you a lot for your help. Best regards, > -----Original Message----- > From: VILLEPREUX Charles, PAC/PS > Sent: 03 August, 2007 10:08 AM > To: 'html-tidy@w3.org' > Cc: CISSOKHO MUTTER Pascale, PAC/PM; DESMARTIN Marion, PAC/PROD > Subject: Questions > > Hi, > > I am currently working at the OECD in Paris and for my current project, I > need to convert HTML files to XHTML files. > I have found on the Internet that "HTML Tidy" is a good tool to do it. It > was originally written by Dave Raggett of the World Wide Web Consortium > (W3C). > The software is now maintained by a group of volunteers working as an Open > Source Community at Source Forge. > > So I have downloaded the EXE Version from 24 jul 2007: > http://www.paehl.com/open_source/?HTML_Tidy_for_Windows > I have created a configuration file. > Command line: tidy -config config.txt index_HTML > > The log file "err.txt" only indicates warnings. > But using http://validator.w3.org/ tool, I have discovered some errors > like: > > 1°) required attribute "alt" not specified for the "img" element > Q1: Why "HTML Tidy" do not create automatically required attributes ? > > 2°) required attribute "action" not specified for the "form" element > > HTML: > <form name=switchit><input type=hidden > value=0 name=switchedselector></form> > > XHTML: > <form name="switchit" id="switchit"><input > type="hidden" value="0" name="switchedselector" /></form> > > Q2: Why does HTML Tidy create the attribute id="switchit" ? > > 3°) ID X already defined > > HTML: > <br /> > <br><br></p><p></p> > > </span></td> > </tr> > > XHTML: > <br /> > <br /> > <br /> > <br /> > <br /></span> > <p><span id="06" style="display: > none;"></span></p> > </td> > </tr> > > > Q3: Why does HTML Tidy create a span element with an id (06) which > already exists ? > > 4) value of attribute "id" invalid: "0" cannot start a name (For example, > id and name attributes must begin with a letter, not a digit) > > HTML > <img src="plusminusimages/01plus.gif" > border="0" id="01_image" onclick="javascript:changePlusMinus('01');" > style="cursor:pointer;cursor:hand" name="01_image" /> > ... > <span id="01" style="display: none ... > > Q4: Do you think it is more a warning than an error ? (It does not > seem to provoke any problem when browsing the XHTML file ...) > > > 5°) there is no attribute "width" for the "div" element > > > HTML: > <div id="showhideText" width="100" > class="normal">Show all indicators</div> > XHTML: > <div id="showhideText" width="100" > class="normal">Show all indicators</div> > > > Q5: Why does "HTML Tidy" keep the "width" attribute ? Why not delete > it ? > > > Afterwards I have checked with Internet Explorer browser the difference > between the HTML and the XHTML > I do not understand these following transformations done by "HTML Tidy": > > I) - A link does not work anymore. > > HTML: > <a > href="javascript:openAll('01','02','03','04','05','06','07','08','09','10', > '11','12')" class="Text"><div id="showhideText" width="100" > class="normal">Show all indicators</div></a> > > XHTML: > <a > href="javascript:openAll('01','02','03','04','05','06','07','08','09','10', > '11','12')" class="Text"></a> > <div id="showhideText" width="100" > class="normal">Show all indicators</div> > > Q6: Why does "HTML Tidy" change the imbrication of tags ? > > > II) - More line breaks. > > HTML: > • <a class='Text' > href='01-02-02.htm'>Elderly population by region</a><br /> > <br><br></p> > > </span></td> > </tr> > > XHTML: > • <a class='Text' > href='01-02-02.htm'>Elderly population by > region</a><br /> > <br /> > <br /> > <br /> > <br /></span></td> > </tr> > > Q7: Why does "HTML Tidy" generate additional <br /> elements ? > > > > See attached HTML, XHTML, log and configuration files: > > > << File: index_HTML.htm >> << File: index_XHTML.htm >> > > << File: err.txt >> << File: config.txt >> > Thank you a lot for your help. May be some parameters have to be added in > the configuration file .... > > Best regards, > > > Charles VILLEPREUX > Technical Assistant > R&D > OECD, PAC > Tel: +33 (0)1 49 10 43 66 > charles.villepreux@oecd.org > > HTML Tidy is a tool that was originally written by Dave Raggett of the > World Wide Web Consortium (W3C). It is designed to fix mistakes in HTML, > tidy up the layout (hence the name), assist with web accessibility, convert > HTML to XHTML and many other things. > The software is now maintained by a group of volunteers working as an Open > Source Community at Source Forge and this is the place to go for more > information. > HTML Tidy Documentation: > http://tidy.sourceforge.net/ >
Received on Monday, 3 September 2007 16:49:49 UTC