W3C home > Mailing lists > Public > html-tidy@w3.org > January to March 2004

Re: api for tidy

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Mon, 16 Feb 2004 16:49:55 +0100
To: <520087207007-0001@T-Online.de>
Cc: <html-tidy@w3.org>
Message-ID: <404fe3b0.338955241@smtp.bjoern.hoehrmann.de>

* Semir Dzankovic wrote:
>my name is semir dzankovic and I am a student from Germany.
>I lam looking for detailed informations to function of html-tidy.
>uses tidy the dom-api or the sax-api for the generation of xhtml
>from html?  on the website (tidy.sourceforge.net) of tidy I did
>not find the concrete details, which can be the effect of my bad
>english.

Neither. Tidy has an internal pull API to tokenize an input stream into
tokens (start tags, end tags, processing instructions, ...) and builds a
tree from these tokens. The actual conversion from HTML to XHTML
involves just adding some attributes (xml:lang, xmlns) and changing the
document type declaration (<!DOCTYPE ...), these are tree modifications.
Finally there is a XHTML pretty printer that takes care of using proper
XHTML syntax.

If you are looking into using Tidy from other applications, the public
API is dom-like but not really W3C DOM and also very very limited, you
cannot change most of parts of the document and you cannot access the
content of text and similar nodes.
Received on Monday, 16 February 2004 10:50:01 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 5 February 2014 07:15:53 UTC