- From: Jacob Nordfalk <nordfalk@mobilixnet.dk>
- Date: Fri, 09 Nov 2001 01:04:05 +0100
- To: Sami Lempinen <sami@lempinen.net>, html-tidy@w3.org
- Message-ID: <3BEB1D75.22CC1F20@mobilixnet.dk>
Jacob Nordfalk wrote:
> Sami Lempinen wrote:
>
> > You're right about the lack of documentation, and you're welcome to
> > write some ;)
>
> OK, I'll try to write some documentation.
Here is my proposal - a README file.
HOW TO USE JTIDY AS A PROGRAM
=============================
If you are not a developer you just need the file Tidy.jar
(found in the subdirectory build/)
Start the program by typing: java -jar Tidy.jar <parameters>
For example. to tidy up a file type: java -jar Tidy.jar file.html
To get help type: java -jar tidy.jar -h
COMMAND LINE PARAMETERS
=======================
Tidy: file1 file2 ...
Utility to clean up & pretty print html files
see http://www.w3.org/People/Raggett/tidy/
options for tidy released on 4th August 2000
Processing directives
--------------------
-indent or -i indent element content
-omit or -o omit optional endtags
-wrap 72 wrap text at column 72 (default is 68)
-upper or -u force tags to upper case (default is lower)
-clean or -c replace font, nobr & center tags by CSS
-numeric or -n output numeric rather than named entities
-errors or -e only show errors
-quiet or -q suppress nonessential output
-xml use this when input is wellformed xml
-asxml to convert html to wellformed xml
-slides to burst into slides on h2 elements
Character encodings
------------------
-raw leave chars > 128 unchanged upon output
-ascii use ASCII for output, Latin-1 for input
-latin1 use Latin-1 for both input and output
-iso2022 use ISO2022 for both input and output
-utf8 use UTF-8 for both input and output
-mac use the Apple MacRoman character set
File manipulation
---------------
-config <file> set options from config file
-f <file> write errors to named <file>
-modify or -m to modify original files
Miscellaneous
------------
-version or -v show version
-help or -h list command line options
You can also use --blah for any config file option blah
Input/Output default to stdin/stdout respectively
Single letter options apart from -f may be combined
as in: tidy -f errs.txt -imu foo.html
For further info on HTML see http://www.w3.org/MarkUp
HOW TO USE AS A BEAN
====================
JTidy can be used as a Javabean from a program. Below
is a simple example program that shows how to get an
URL and apply Tidy on it
/**
* Example of how to use the Tidy bean
* Courtesy Chris Raber. Modified by Jacob Nordfalk
*/
// core Java stuff.
import java.io.*;
import java.text.*;
import java.util.*;
import java.net.*;
// JTidy stuff.
import org.w3c.tidy.Tidy;
public class TidyURL {
public static void main(String args[]) {
if(args.length != 1) {
System.out.println("Usage TidyURL url");
System.out.println("Full example: \n"
+"java -cp .:Tidy.jar TidyURL http://www.esperanto.net");
return;
}
String url = args[0];
try {
URL u = new URL(url);
Reader reader;
BufferedInputStream sourceIn = new BufferedInputStream(u.openStream());
ByteArrayOutputStream tidyOutStream = new ByteArrayOutputStream();
// Create the Tidy bean
Tidy tidy = new Tidy();
// Set bean properties
tidy.setQuiet(false);
tidy.setShowWarnings(true);
tidy.setIndentContent(true);
tidy.setSmartIndent(true);
tidy.setIndentAttributes(false);
tidy.setWraplen(1024);
//tidy.setXHTML(true);
//tidy.setXmlOut(true);
tidy.setErrout(new PrintWriter(System.out));
tidy.parse(sourceIn, tidyOutStream);
System.out.println(tidyOutStream.toString());
} catch (Exception ex) {
ex.printStackTrace();
}
}
}
--
Jacob Nordfalk
Attachments
- application/octet-stream attachment: README
Received on Thursday, 8 November 2001 19:04:07 UTC