W3C home > Mailing lists > Public > html-tidy@w3.org > January to March 2001

Problems with entities HTML -> XML! (new to list)

From: Niels Peter Strandberg <nielspeter@npstrandberg.com>
Date: Mon, 29 Jan 2001 22:19:48 +0100
Message-Id: <200101292119.WAA01371@d1o16.telia.com>
To: html-tidy@w3c.org
Hi!

(Using jTidy)

Im converting a html file to xml.  I have 2 problems that I need to know how to solve.

Code:

        tidy.setXmlOut(true);
        tidy.setFixBackslash(true); // URL FixBackslash
        tidy.setRawOut(true); // RawOut - avoid mapping values > 127 to entities
        tidy.setXmlPi(true); // XmlPi - add <?xml?> for XML docs
        tidy.setQuoteAmpersand(true); // QuoteAmpersand - output naked ampersand as &
        tidy.setTidyMark(false); // TidyMark - add meta element indicating tidied doc
        tidy.setWraplen(99999); // Wraplen - default wrap margin



The result file output:

<?xml version="1.0"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<html>
<head>
<link rel="made" href="wsanchez@apple.com" />
<title>Welcome to Mac OS X!</title>
...........
Received on Monday, 29 January 2001 16:19:24 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:45 GMT