W3C home > Mailing lists > Public > html-tidy@w3.org > October to December 2003

Re: Using htmltidy to parse: getting the "body" of a node

From: Jany Quintard <jany.quintard@free.fr>
Date: Thu, 2 Oct 2003 09:48:21 +0200
To: joe user <palehaole@yahoo.com>
Cc: html-tidy@w3.org
Message-ID: <20031002074821.GH27073@figue>

* joe user [Wed, 01/10/2003 at 12:45 -0700]
> 
> Hello Tidy people,
> 
> I am trying to use Tidy to do its magic on (possibly
> broken) html files, for input to other layers of
> processing in C.  I need to get access to the body of
> stuff.
> 
> For example, in this block:
> 
> <p>This is some text.</p>
I do not know if it is easy to do with tidy. Could you use another tool
such as Openjade or soemthing else?

In DSSSL with (Open)Jade, you would write
(element p
  (literal (data (current-node))))
 
to obtain the stuff inside the element.

Jany
Received on Thursday, 2 October 2003 03:48:29 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:54 UTC