Problem using...

To html-tidy,

I am facing a problem using tidy.
I am trying to convert a html to xml. Tidy has been a great
help to me for this, but I am facing some problems in some cases.

The Tidy works fine when I save the html page from the browser
and the html is converted to xml witjout any errors with some
warnings.
However When I  download it using a program and then run tidy
it gives me errors and I cant convert html to xml.

This is sample code in java I use to download a html:

 InputStream in = null;
 OutputStream out = null;

 URL url = new URL("http://"+siteurl);   // Create the URL
 in = url.openStream();        // Open a stream to it
 out = new FileOutputStream(filename);

 // Now copy bytes from the URL to the output stream
  byte[] buffer = new byte[4096];
  int bytes_read = 0;
  while(true)
  {
        bytes_read = in.read(buffer);
        if(bytes_read == -1)
	break;
        out.write(buffer, 0, bytes_read);
  }


Can anyone help me in this regarding.

Regards,
Valmik Desai.

Received on Sunday, 20 April 2003 00:40:48 UTC