- From: Bjoern Hoehrmann <derhoermi@gmx.net>
- Date: Mon, 16 Feb 2004 16:49:55 +0100
- To: <520087207007-0001@T-Online.de>
- Cc: <html-tidy@w3.org>
* Semir Dzankovic wrote: >my name is semir dzankovic and I am a student from Germany. >I lam looking for detailed informations to function of html-tidy. >uses tidy the dom-api or the sax-api for the generation of xhtml >from html? on the website (tidy.sourceforge.net) of tidy I did >not find the concrete details, which can be the effect of my bad >english. Neither. Tidy has an internal pull API to tokenize an input stream into tokens (start tags, end tags, processing instructions, ...) and builds a tree from these tokens. The actual conversion from HTML to XHTML involves just adding some attributes (xml:lang, xmlns) and changing the document type declaration (<!DOCTYPE ...), these are tree modifications. Finally there is a XHTML pretty printer that takes care of using proper XHTML syntax. If you are looking into using Tidy from other applications, the public API is dom-like but not really W3C DOM and also very very limited, you cannot change most of parts of the document and you cannot access the content of text and similar nodes.
Received on Monday, 16 February 2004 10:50:01 UTC