W3C home > Mailing lists > Public > html-tidy@w3.org > October to December 2005

Tidy and removing attributes

From: EL-HAID Nabil <elhaid@ecole.ensicaen.fr>
Date: Tue, 8 Nov 2005 15:42:55 +0200
To: html-tidy@w3.org <html-tidy@w3.org>
Message-Id: <20051108132908.M26046@ensicaen.ismra.fr>


I am a student in a french university and i am working over a project in 
association with France Telecom R&D. the aim of the project is to extract 
informations from bourse web sites.
I encountered a problem during my developpement: i don't know how to remove 
all the tag's attributes when i transform my imput stream to an XML file 
using Tidy.(i want to remove them because i work with XSLT processor)
for example:
the input stream look like this : 

<body bgcolor="FFFFFF" bgsound="toto.wav"> ..... </body>

and the output.xml should look like this :

<body> ..... </body>

my question is: how can i use Tidy to reach my goal??

Any help would be greatly appreciated.

thank you
Received on Wednesday, 9 November 2005 03:19:27 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:55 UTC