- From: tOM Trottier <tOM@Abacurial.com>
- Date: Wed, 17 Apr 2013 02:51:54 -0400
- To: <html-tidy@w3.org>
- Message-ID: <516E468A.18553.20D97077@tOM.Abacurial.com>
2013 April 17 - Wednesday at 10:57 re:Re: help? …
Richard A. O'Keefe <ok@cs.otago.ac.nz>wrote …
>When Tidy says "This document has errors that must be fixed before
>using HTML Tidy to generate a tidied up version",
>
>- does it give you a location in the file?
>- does it say anything about what those errors _are_?
>
>General questions:
>
>- Take a typical file you want to clean up. How big is it?
>- How is it encoded (character set)? Did you tell Tidy what
> that encoding is?
>- How was the file generated?
>- Do you have access to a web browser that will tell you about
> syntax errors? (I have iCab, but that's Mac only, I believe.)
> The answer is YES YOU DO have access to such a web browser.
> The World Wide Web Consortium have a browser/editor called
> Amaya and you can download a Windows version from
> http://www.w3.org/Amaya/User/BinDist.html
> (or a Linux version or a Mac OS X version).
>- What happens if you submit your file to the W3C's validator
> at validator.w3c.org?
>
>If all else fails,
>
>- do you have a small example you could post?
For what it's worth, I use the PSPad editor (in Windows) which has integrated Tidy
and an IE display. Tidy errors and warnings appear in a window below the source.
You can also use different sets of parameters, and you can tell Tidy to tidy up the
errors as best it can. But I let it tell me what and where the errors are so I can fix
by hand.
These are my annotated sets of parameters to Tidy:
; quick reference: http://tidy.sourceforge.net/docs/quickref.html
; cesky preklad: http://www.pspad.com/cz/tidy.htm
[Tidy- tom no-CSS]
; note, only the pretty print is different between tom indent /
no-indent---------------------
;------------------xml,html, xhtml options--------------------------
add-xml-decl:yes;Example: y/n, yes/no, t/f, true/false, 1/0 ;This option
specifies if Tidy should add the XML declaration when outputting XML or
XHTML. Note that if the input already includes an <?xml ... ?> declaration then
this option will be ignored. If the encoding for the output is different from
"ascii", one of the utf encodings or "raw", the declaration is always added as
required by the XML standard. ;See also: char-encoding, output-encoding
add-xml-space:yes;This option specifies if Tidy should add
xml:space="preserve" to elements such as <PRE>, <STYLE> and <SCRIPT>
when generating XML. This is needed if the whitespace in such elements is to
be parsed appropriately without having access to the DTD.
alt-text:Image;Type: String ;Default: - Default: - ;This option specifies the
default "alt=" text Tidy uses for <IMG> attributes. This feature is dangerous
as it suppresses further accessibility warnings. You are responsible for making
your documents accessible to people who can not see the images!
anchor-as-name:yes;Type: Boolean Default: yes Example: y/n, yes/no, t/f,
true/false, 1/0 This option controls the deletion or addition of the name
attribute in elements where it can serve as anchor. If set to "yes", a name
attribute, if not already existing, is added along an existing id attribute if the
DTD allows it. If set to "no", any existing name attribute is removed if an id
attribute exists or has been added.
assume-xml-procins:no;Default: no;Example: y/n, yes/no, t/f, true/false, 1/0
;This option specifies if Tidy should change the parsing of processing
instructions to require ?> as the terminator rather than >. This option is
automatically set if the input is in XML.
bare:yes;This option specifies if Tidy should strip Microsoft specific HTML from
Word 2000 documents, and output spaces rather than non-breaking spaces
where they exist in the input.
clean:no;This option specifies if Tidy should strip out surplus presentational
tags and attributes replacing them by style rules and structural markup as
appropriate. It works well on the HTML saved by Microsoft Office products.
;See also: drop-font-tags
css-prefix:c;Type: String ;Default: c ;This option specifies the prefix that Tidy
uses for styles rules. By default, "c" will be used.
decorate-inferred-ul:no;This option specifies if Tidy should decorate inferred UL
elements with some CSS markup to avoid indentation to the right.
doctype:auto;Type: DocType ;Default: auto ;Example: omit, auto, strict,
transitional, user ;This option specifies the DOCTYPE declaration generated
by Tidy. If set to "omit" the output won’t contain a DOCTYPE declaration. If set
to "auto" (the default) Tidy will use an educated guess based upon the
contents of the document. If set to "strict", Tidy will set the DOCTYPE to the
strict DTD. If set to "loose", the DOCTYPE is set to the loose (transitional) DTD.
Alternatively, you can supply a string for the formal public identifier (FPI).
;For example:
;doctype:"-//ACME//DTD HTML 3.14159//EN" ;If you specify the FPI for an XHTML
document, Tidy will set the system identifier to an empty string. For an HTML
document, Tidy adds a system identifier only if one was already present in
order to preserve the processing mode of some browsers. Tidy leaves the
DOCTYPE for generic XML documents unchanged. --doctype omit implies
--numeric-entities yes. This option does not offer a validation of the document
conformance.
drop-empty-paras:yes;This option specifies if Tidy should discard empty
paragraphs.
drop-font-tags:no;Default: no;This option specifies if Tidy should discard
<FONT> and <CENTER> tags without creating the corresponding style rules.
This option can be set independently of the clean option. ;See also: clean
drop-proprietary-attributes:yes;Default: no ;This option specifies if Tidy
should strip out proprietary attributes, such as MS data binding attributes.
enclose-block-text:yes;Default: no ;This option specifies if Tidy should insert
a <P> element to enclose any text it finds in any element that allows mixed
content for HTML transitional but not HTML strict.
enclose-text:yes;Default: no ;This option specifies if Tidy should enclose any
text it finds in the body element within a <P> element. This is useful when you
want to take existing HTML and use it with a style sheet.
escape-cdata:no;Default: no ;This option specifies if Tidy should convert
<![CDATA[]]> sections to normal text.
fix-backslash:yes;Default: yes ;This option specifies if Tidy should replace
backslash characters "\" in URLs by forward slashes "/".
fix-bad-comments:yes;Default: yes ;This option specifies if Tidy should replace
unexpected hyphens with "=" characters when it comes across adjacent
hyphens. The default is yes. This option is provided for users of Cold Fusion
which uses the comment syntax: <!--- --->
fix-uri:yes;Default: yes ;This option specifies if Tidy should check attribute
values that carry URIs for illegal characters and if such are found, escape them
as HTML 4 recommends.
hide-comments:no;Default: no ;This option specifies if Tidy should print out
comments.
hide-endtags:no;Default: no ;This option specifies if Tidy should omit optional
end-tags when generating the pretty printed markup. This option is ignored if
you are outputting to XML.
indent-cdata:no;Default: no ;This option specifies if Tidy should indent
<![CDATA[]]> sections.
input-xml:no;Default: no ;This option specifies if Tidy should use the XML
parser rather than the error correcting HTML parser.
join-classes:no;Default: no ;This option specifies if Tidy should combine class
names to generate a single new class name, if multiple class assignments are
detected on an element. ;See also: join-styles, repeated-attributes
join-styles:yes;Default: yes ;This option specifies if Tidy should combine styles
to generate a single new style, if multiple style values are detected on an
element. ;See also: join-classes, repeated-attributes
literal-attributes:no;Default: no ;This option specifies if Tidy should ensure that
whitespace characters within attribute values are passed through unchanged.
logical-emphasis:yes;Default: no ;This option specifies if Tidy should replace
any occurrence of <I> by <EM> and any occurrence of <B> by <STRONG>.
In both cases, the attributes are preserved unchanged. This option can be set
independently of the clean and drop-font-tags options.
lower-literals:yes;Default: yes ;This option specifies if Tidy should convert the
value of an attribute that takes a list of predefined values to lower case. This is
required for XHTML documents.
merge-divs:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <div> such as
"<div><div>...</div></div>". If set to "auto", the attributes of the inner
<div> are moved to the outer one. As well, nested <div> with ID attributes
are not merged. If set to "yes", the attributes of the inner <div> are discarded
with the exception of "class" and "style". ;See also: clean, merge-spans
merge-spans:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <span> such as
"<span><span>...</span></span>". The algorithm is identical to the one
used by --merge-divs. ;See also: clean, merge-divs
ncr:yes;Default: yes ;This option specifies if Tidy should allow numeric
character references.
;new-blocklevel-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...
;This option specifies new block-level tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
Note you can’t change the content model for elements such as <TABLE>,
<UL>, <OL> and <DL>. This option is ignored in XML mode. ;See also:
new-empty-tags, new-inline-tags, new-pre-tags
;new-empty-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This
option specifies new empty inline tags. This option takes a space or comma
separated list of tag names. Unless you declare new tags, Tidy will refuse to
generate a tidied file if the input includes previously unknown tags. Remember
to also declare empty tags as either inline or blocklevel. This option is ignored
in XML mode. ;See also: new-blocklevel-tags, new-inline-tags, new-pre-tags
;new-inline-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...;This
option specifies new non-empty inline tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
This option is ignored in XML mode. ;See also: new-blocklevel-tags,
new-empty-tags, new-pre-tags
;new-pre-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This option
specifies new tags that are to be processed in exactly the same way as HTML’s
<PRE> element. This option takes a space or comma separated list of tag
names. Unless you declare new tags, Tidy will refuse to generate a tidied file if
the input includes previously unknown tags. Note you can not as yet add new
CDATA elements (similar to <SCRIPT>). This option is ignored in XML mode.
;See also: new-blocklevel-tags, new-empty-tags, new-inline-tags
;numeric-entities;Default: no ;This option specifies if Tidy should output
entities other than the built-in HTML entities (&, <, > and ") in
the numeric rather than the named entity form. Only entities compatible with
the DOCTYPE declaration generated are used. Entities that can be represented
in the output encoding are translated correspondingly. ;See also: doctype,
preserve-entities
output-html:no;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as HTML.
output-xhtml:yes;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as extensible HTML. This option causes Tidy to
set the DOCTYPE and default namespace as appropriate to XHTML. If a
DOCTYPE or namespace is given they will checked for consistency with the
content of the document. In the case of an inconsistency, the corrected values
will appear in the output. For XHTML, entities can be written as named or
numeric entities according to the setting of the "numeric-entities" option. The
original case of tags and attributes will be preserved, regardless of other
options.
output-xml:no;Default: no ;This option specifies if Tidy should pretty print
output, writing it as well-formed XML. Any entities not defined in XML 1.0 will
be written as numeric entities to allow them to be parsed by a XML parser. The
original case of tags and attributes will be preserved, regardless of other
options.
preserve-entities:yes;Default: no ;This option specifies if Tidy should preserve
the well-formed entitites as found in the input.
quote-ampersand:yes;Default: yes ;This option specifies if Tidy should output
unadorned & characters as &.
quote-marks:yes;Default: no ;This option specifies if Tidy should output "
characters as " as is preferred by some editing environments. The
apostrophe character ’ is written out as ' since many web browsers don’t
yet support '.
quote-nbsp:yes;Default: yes ;This option specifies if Tidy should output
non-breaking space characters as entities, rather than as the Unicode
character value 160 (decimal).
repeated-attributes:keep-last;Type: enum ;Default: keep-last ;Example:
keep-first, keep-last ;This option specifies if Tidy should keep the first or last
attribute, if an attribute is repeated, e.g. has two align attributes. ;See also:
join-classes, join-styles
replace-color:yes;Default: no ;This option specifies if Tidy should replace
numeric values in color attributes by HTML/XHTML color names where defined,
e.g. replace "#ffffff" with "white".
show-body-only:no;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should print only the contents of the
body tag as an HTML fragment. If set to "auto", this is performed only if the
body tag has been inferred. Useful for incorporating existing whole pages as a
portion of another page.
uppercase-attributes:no;Default: no ;This option specifies if Tidy should output
attribute names in upper case. The default is no, which results in lower case
attribute names, except for XML input, where the original case is preserved.
uppercase-tags:no;Default: no ;This option specifies if Tidy should output tag
names in upper case. The default is no, which results in lower case tag names,
except for XML input, where the original case is preserved.
word-2000:yes;Default: no ;This option specifies if Tidy should go to great
pains to strip out all the surplus stuff Microsoft Word 2000 inserts when you
save Word documents as "Web pages". Doesn’t handle embedded images or
VML. You should consider using Word’s "Save As: Web Page, Filtered".
;------Diagnosticsoptions:-------------------------------------
accessibility-check:3;Type: enum ;Default: 0 (Tidy Classic) ;Example: 0 (Tidy
Classic), 1 (Priority 1 Checks), 2 (Priority 2 Checks), 3 (Priority 3 Checks) ;This
option specifies what level of accessibility checking, if any, that Tidy should
do. Level 0 is equivalent to Tidy Classic’s accessibility checking. For more
information on Tidy’s accessibility checking, visit the Adaptive Technology
Resource Centre at the University of Toronto at
http://www.aprompt.ca/Tidy/accessibilitychecks.html.
show-errors:10 ;Type: Integer ;Default: 6 ;Example: 0, 1, 2, ... ;This option
specifies the number Tidy uses to determine if further errors should be shown.
If set to 0, then no errors are shown.
show-warnings:yes;Default: yes ;This option specifies if Tidy should suppress
warnings. This can be useful when a few errors are hidden in a flurry of
warnings.
;----------------------------PrettyPrint options:-----------------------------
break-before-br:no;Default: no ;This option specifies if Tidy should output a line
break before each <BR> element.
indent:no;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should indent block-level tags. If set
to "auto", this option causes Tidy to decide whether or not to indent the
content of tags such as TITLE, H1-H6, LI, TD, TD, or P depending on whether or
not the content includes a block-level element. You are advised to avoid
setting indent to yes as this can expose layout bugs in some browsers. ;See
also: indent-spaces
indent-attributes:no;Default: no ;This option specifies if Tidy should begin
each attribute on a new line.
indent-spaces:0;Type: Integer ;Default: 2 ;Example: 0, 1, 2, ... ;This option
specifies the number of spaces Tidy uses to indent content, when indentation
is enabled.= ;See also: indent
markup:yes;Default: yes ;This option specifies if Tidy should generate a pretty
printed version of the markup. Note that Tidy won’t generate a pretty printed
version if it finds significant errors (see force-output).
punctuation-wrap:no;Default: no ;This option specifies if Tidy should line wrap
after some Unicode or Chinese punctuation characters.
sort-attributes:alpha;Type: enum ;Default: none ;Example: none, alpha ;This
option specifies that tidy should sort attributes within an element using the
specified sort algorithm. If set to "alpha", the algorithm is an ascending
alphabetic sort.
;split;Default: no ;Currently not used. Tidy Classic only.
tab-size:2;Type: Integer ;Default: 8 ;Example: 0, 1, 2, ... ;This option
specifies the number of columns that Tidy uses between successive tab stops.
It is used to map tabs to spaces when reading the input. Tidy never outputs
tabs.
vertical-space:no;Default: no ;This option specifies if Tidy should add some
empty lines for readability.
wrap:208;Type: Integer ;Default: 68 ;Example: 0 (no wrapping), 1, 2, ...
;This option specifies the right margin Tidy uses for line wrapping. Tidy tries
to wrap lines so that they do not exceed this length. Set wrap to zero if you
want to disable line wrapping.
wrap-asp:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within ASP pseudo elements, which look like: <% ... %>.
wrap-attributes:no;Default: no ;This option specifies if Tidy should line wrap
attribute values, for easier editing. This option can be set independently of
wrap-script-literals. ;See also: wrap-script-literals
wrap-jste:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within JSTE pseudo elements, which look like: <# ... #>.
wrap-php:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within PHP pseudo elements, which look like: <?php ... ?>.
wrap-script-literals:no;Default: no ;This option specifies if Tidy should line wrap
string literals that appear in script attributes. Tidy wraps long script string
literals by inserting a backslash character before the line break. ;See also:
wrap-attributes
wrap-sections:no;Default: yes ;This option specifies if Tidy should line wrap
text contained within <![ ... ]> section tags.
;------------------------------------CharacterEncoding
options:--------------------------------------
ascii-chars:no;Default: no ;Can be used to modify behavior of -c (--clean yes)
option. If set to "yes" when using -c, &emdash;, ”, and other named
character entities are downgraded to their closest ascii equivalents. ;See
also: clean
char-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for both the
input and output. For ascii, Tidy will accept Latin-1 (ISO-8859-1) character
values, but will use entities for all characters whose value > 127. For raw, Tidy
will output values above 127 without translating them into entities. For latin1,
characters above 255 will be written as entities. For utf8, Tidy assumes that
both input and output is encoded as UTF-8. You can use iso2022 for files
encoded using the ISO-2022 family of encodings e.g. ISO-2022-JP. For mac and
win1252, Tidy will accept vendor specific character values, but will use entities
for all characters whose value > 127. ;See also: input-encoding,
output-encoding
input-encoding:utf8;Type: Encoding ;Default: latin1 ;Example: raw, ascii,
latin0, latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16,
big5, shiftjis ;This option specifies the character encoding Tidy uses for the
input. See char-encoding for more info. ;See also: char-encoding
language:en;Type: String ;Default: - ;Default: - ;Currently not used, but this
option specifies the language Tidy uses (for instance "en").
;newline;Type: enum ;Default: Platform dependent ;Example: LF, CRLF, CR
;The default is appropriate to the current platform: CRLF on PC-DOS,
MS-Windows and OS/2, CR on Classic Mac OS, and LF everywhere else (Unix
and Linux).
;output-bom;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should write a Unicode Byte Order
Mark character (BOM; also known as Zero Width No-Break Space; has value of
U+FEFF) to the beginning of the output; only for UTF-8 and UTF-16 output
encodings. If set to "auto", this option causes Tidy to write a BOM to the
output only if a BOM was present at the beginning of the input. A BOM is
always written for XML/XHTML output using UTF-16 output encodings.
output-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for the output.
See char-encoding for more info. May only be different from input-encoding for
Latin encodings (ascii, latin0, latin1, mac, win1252, ibm858). ;See also:
char-encoding
;----------------------Miscellaneousoptions:-------------------------------
;error-file;Type: String ;Default: - ;Default: - ;This option specifies the error file
Tidy uses for errors and warnings. Normally errors and warnings are output to
"stderr". ;See also: output-file
;force-output;Default: no ;This option specifies if Tidy should produce output
even if errors are encountered. Use this option with care - if Tidy reports an
error, this means Tidy was not able to, or is not sure how to, fix the error, so
the resulting output may not reflect your intention.
;gnu-emacs;Default: no ;This option specifies if Tidy should change the
format for reporting errors and warnings to a format that is more easily parsed
by GNU Emacs.
;gnu-emacs-file;Type: String ;Default: - Default: - Used internally.
keep-time:yes;Default: no ;This option specifies if Tidy should keep the
original modification time of files that Tidy modifies in place. The default is no.
Setting the option to yes allows you to tidy files without causing these files to
be uploaded to a web server when using a tool such as SiteCopy. Note this
feature is not supported on some platforms.
;output-file;Type: String ;Default: -;Default: - ; This option specifies the output
file Tidy uses for markup. Normally markup is written to "stdout". ;See also:
error-file
;quiet;Default: no ;This option specifies if Tidy should output the summary of
the numbers of errors and warnings, or the welcome or informational
messages.
;slide-style;Type: String ;Default: - ;Default: - ;Currently not used. Tidy Classic
only.
tidy-mark:yes;Default: yes ;This option specifies if Tidy should add a meta
element to the document head to indicate that the document has been tidied.
Tidy won’t add a meta element if one is already present.
write-back:yes;Default: no ;This option specifies if Tidy should write back the
tidied markup to the same file it read from. You are advised to keep copies of
important files before tidying them, as on rare occasions the result may not be
what you expect.
[Tidy- tom no-indent]
; note, only the pretty print is different between tom indent /
no-indent---------------------
;------------------xml,html, xhtml options--------------------------
add-xml-decl:yes;Example: y/n, yes/no, t/f, true/false, 1/0 ;This option
specifies if Tidy should add the XML declaration when outputting XML or
XHTML. Note that if the input already includes an <?xml ... ?> declaration then
this option will be ignored. If the encoding for the output is different from
"ascii", one of the utf encodings or "raw", the declaration is always added as
required by the XML standard. ;See also: char-encoding, output-encoding
add-xml-space:yes;This option specifies if Tidy should add
xml:space="preserve" to elements such as <PRE>, <STYLE> and <SCRIPT>
when generating XML. This is needed if the whitespace in such elements is to
be parsed appropriately without having access to the DTD.
alt-text:Image;Type: String ;Default: - Default: - ;This option specifies the
default "alt=" text Tidy uses for <IMG> attributes. This feature is dangerous
as it suppresses further accessibility warnings. You are responsible for making
your documents accessible to people who can not see the images!
anchor-as-name:yes;Type: Boolean Default: yes Example: y/n, yes/no, t/f,
true/false, 1/0 This option controls the deletion or addition of the name
attribute in elements where it can serve as anchor. If set to "yes", a name
attribute, if not already existing, is added along an existing id attribute if the
DTD allows it. If set to "no", any existing name attribute is removed if an id
attribute exists or has been added.
assume-xml-procins:no;Default: no;Example: y/n, yes/no, t/f, true/false, 1/0
;This option specifies if Tidy should change the parsing of processing
instructions to require ?> as the terminator rather than >. This option is
automatically set if the input is in XML.
bare:yes;This option specifies if Tidy should strip Microsoft specific HTML from
Word 2000 documents, and output spaces rather than non-breaking spaces
where they exist in the input.
clean:yes;This option specifies if Tidy should strip out surplus presentational
tags and attributes replacing them by style rules and structural markup as
appropriate. It works well on the HTML saved by Microsoft Office products.
;See also: drop-font-tags
css-prefix:c;Type: String ;Default: c ;This option specifies the prefix that Tidy
uses for styles rules. By default, "c" will be used.
decorate-inferred-ul:no;This option specifies if Tidy should decorate inferred UL
elements with some CSS markup to avoid indentation to the right.
doctype:auto;Type: DocType ;Default: auto ;Example: omit, auto, strict,
transitional, user ;This option specifies the DOCTYPE declaration generated
by Tidy. If set to "omit" the output won’t contain a DOCTYPE declaration. If set
to "auto" (the default) Tidy will use an educated guess based upon the
contents of the document. If set to "strict", Tidy will set the DOCTYPE to the
strict DTD. If set to "loose", the DOCTYPE is set to the loose (transitional) DTD.
Alternatively, you can supply a string for the formal public identifier (FPI).
;For example:
;doctype:"-//ACME//DTD HTML 3.14159//EN" ;If you specify the FPI for an XHTML
document, Tidy will set the system identifier to an empty string. For an HTML
document, Tidy adds a system identifier only if one was already present in
order to preserve the processing mode of some browsers. Tidy leaves the
DOCTYPE for generic XML documents unchanged. --doctype omit implies
--numeric-entities yes. This option does not offer a validation of the document
conformance.
drop-empty-paras:yes;This option specifies if Tidy should discard empty
paragraphs.
drop-font-tags:no;Default: no;This option specifies if Tidy should discard
<FONT> and <CENTER> tags without creating the corresponding style rules.
This option can be set independently of the clean option. ;See also: clean
drop-proprietary-attributes:yes;Default: no ;This option specifies if Tidy
should strip out proprietary attributes, such as MS data binding attributes.
enclose-block-text:yes;Default: no ;This option specifies if Tidy should insert
a <P> element to enclose any text it finds in any element that allows mixed
content for HTML transitional but not HTML strict.
enclose-text:yes;Default: no ;This option specifies if Tidy should enclose any
text it finds in the body element within a <P> element. This is useful when you
want to take existing HTML and use it with a style sheet.
escape-cdata:no;Default: no ;This option specifies if Tidy should convert
<![CDATA[]]> sections to normal text.
fix-backslash:yes;Default: yes ;This option specifies if Tidy should replace
backslash characters "\" in URLs by forward slashes "/".
fix-bad-comments:yes;Default: yes ;This option specifies if Tidy should replace
unexpected hyphens with "=" characters when it comes across adjacent
hyphens. The default is yes. This option is provided for users of Cold Fusion
which uses the comment syntax: <!--- --->
fix-uri:yes;Default: yes ;This option specifies if Tidy should check attribute
values that carry URIs for illegal characters and if such are found, escape them
as HTML 4 recommends.
hide-comments:no;Default: no ;This option specifies if Tidy should print out
comments.
hide-endtags:no;Default: no ;This option specifies if Tidy should omit optional
end-tags when generating the pretty printed markup. This option is ignored if
you are outputting to XML.
indent-cdata:no;Default: no ;This option specifies if Tidy should indent
<![CDATA[]]> sections.
input-xml:no;Default: no ;This option specifies if Tidy should use the XML
parser rather than the error correcting HTML parser.
join-classes:no;Default: no ;This option specifies if Tidy should combine class
names to generate a single new class name, if multiple class assignments are
detected on an element. ;See also: join-styles, repeated-attributes
join-styles:yes;Default: yes ;This option specifies if Tidy should combine styles
to generate a single new style, if multiple style values are detected on an
element. ;See also: join-classes, repeated-attributes
literal-attributes:no;Default: no ;This option specifies if Tidy should ensure that
whitespace characters within attribute values are passed through unchanged.
logical-emphasis:yes;Default: no ;This option specifies if Tidy should replace
any occurrence of <I> by <EM> and any occurrence of <B> by <STRONG>.
In both cases, the attributes are preserved unchanged. This option can be set
independently of the clean and drop-font-tags options.
lower-literals:yes;Default: yes ;This option specifies if Tidy should convert the
value of an attribute that takes a list of predefined values to lower case. This is
required for XHTML documents.
merge-divs:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <div> such as
"<div><div>...</div></div>". If set to "auto", the attributes of the inner
<div> are moved to the outer one. As well, nested <div> with ID attributes
are not merged. If set to "yes", the attributes of the inner <div> are discarded
with the exception of "class" and "style". ;See also: clean, merge-spans
merge-spans:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <span> such as
"<span><span>...</span></span>". The algorithm is identical to the one
used by --merge-divs. ;See also: clean, merge-divs
ncr:yes;Default: yes ;This option specifies if Tidy should allow numeric
character references.
;new-blocklevel-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...
;This option specifies new block-level tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
Note you can’t change the content model for elements such as <TABLE>,
<UL>, <OL> and <DL>. This option is ignored in XML mode. ;See also:
new-empty-tags, new-inline-tags, new-pre-tags
;new-empty-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This
option specifies new empty inline tags. This option takes a space or comma
separated list of tag names. Unless you declare new tags, Tidy will refuse to
generate a tidied file if the input includes previously unknown tags. Remember
to also declare empty tags as either inline or blocklevel. This option is ignored
in XML mode. ;See also: new-blocklevel-tags, new-inline-tags, new-pre-tags
;new-inline-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...;This
option specifies new non-empty inline tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
This option is ignored in XML mode. ;See also: new-blocklevel-tags,
new-empty-tags, new-pre-tags
;new-pre-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This option
specifies new tags that are to be processed in exactly the same way as HTML’s
<PRE> element. This option takes a space or comma separated list of tag
names. Unless you declare new tags, Tidy will refuse to generate a tidied file if
the input includes previously unknown tags. Note you can not as yet add new
CDATA elements (similar to <SCRIPT>). This option is ignored in XML mode.
;See also: new-blocklevel-tags, new-empty-tags, new-inline-tags
;numeric-entities;Default: no ;This option specifies if Tidy should output
entities other than the built-in HTML entities (&, <, > and ") in
the numeric rather than the named entity form. Only entities compatible with
the DOCTYPE declaration generated are used. Entities that can be represented
in the output encoding are translated correspondingly. ;See also: doctype,
preserve-entities
output-html:no;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as HTML.
output-xhtml:yes;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as extensible HTML. This option causes Tidy to
set the DOCTYPE and default namespace as appropriate to XHTML. If a
DOCTYPE or namespace is given they will checked for consistency with the
content of the document. In the case of an inconsistency, the corrected values
will appear in the output. For XHTML, entities can be written as named or
numeric entities according to the setting of the "numeric-entities" option. The
original case of tags and attributes will be preserved, regardless of other
options.
output-xml:no;Default: no ;This option specifies if Tidy should pretty print
output, writing it as well-formed XML. Any entities not defined in XML 1.0 will
be written as numeric entities to allow them to be parsed by a XML parser. The
original case of tags and attributes will be preserved, regardless of other
options.
preserve-entities:yes;Default: no ;This option specifies if Tidy should preserve
the well-formed entitites as found in the input.
quote-ampersand:yes;Default: yes ;This option specifies if Tidy should output
unadorned & characters as &.
quote-marks:yes;Default: no ;This option specifies if Tidy should output "
characters as " as is preferred by some editing environments. The
apostrophe character ’ is written out as ' since many web browsers don’t
yet support '.
quote-nbsp:yes;Default: yes ;This option specifies if Tidy should output
non-breaking space characters as entities, rather than as the Unicode
character value 160 (decimal).
repeated-attributes:keep-last;Type: enum ;Default: keep-last ;Example:
keep-first, keep-last ;This option specifies if Tidy should keep the first or last
attribute, if an attribute is repeated, e.g. has two align attributes. ;See also:
join-classes, join-styles
replace-color:yes;Default: no ;This option specifies if Tidy should replace
numeric values in color attributes by HTML/XHTML color names where defined,
e.g. replace "#ffffff" with "white".
show-body-only:no;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should print only the contents of the
body tag as an HTML fragment. If set to "auto", this is performed only if the
body tag has been inferred. Useful for incorporating existing whole pages as a
portion of another page.
uppercase-attributes:no;Default: no ;This option specifies if Tidy should output
attribute names in upper case. The default is no, which results in lower case
attribute names, except for XML input, where the original case is preserved.
uppercase-tags:no;Default: no ;This option specifies if Tidy should output tag
names in upper case. The default is no, which results in lower case tag names,
except for XML input, where the original case is preserved.
word-2000:yes;Default: no ;This option specifies if Tidy should go to great
pains to strip out all the surplus stuff Microsoft Word 2000 inserts when you
save Word documents as "Web pages". Doesn’t handle embedded images or
VML. You should consider using Word’s "Save As: Web Page, Filtered".
;------Diagnosticsoptions:-------------------------------------
accessibility-check:3;Type: enum ;Default: 0 (Tidy Classic) ;Example: 0 (Tidy
Classic), 1 (Priority 1 Checks), 2 (Priority 2 Checks), 3 (Priority 3 Checks) ;This
option specifies what level of accessibility checking, if any, that Tidy should
do. Level 0 is equivalent to Tidy Classic’s accessibility checking. For more
information on Tidy’s accessibility checking, visit the Adaptive Technology
Resource Centre at the University of Toronto at
http://www.aprompt.ca/Tidy/accessibilitychecks.html.
show-errors:10 ;Type: Integer ;Default: 6 ;Example: 0, 1, 2, ... ;This option
specifies the number Tidy uses to determine if further errors should be shown.
If set to 0, then no errors are shown.
show-warnings:yes;Default: yes ;This option specifies if Tidy should suppress
warnings. This can be useful when a few errors are hidden in a flurry of
warnings.
;----------------------------PrettyPrint options:-----------------------------
break-before-br:no;Default: no ;This option specifies if Tidy should output a line
break before each <BR> element.
indent:no;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should indent block-level tags. If set
to "auto", this option causes Tidy to decide whether or not to indent the
content of tags such as TITLE, H1-H6, LI, TD, TD, or P depending on whether or
not the content includes a block-level element. You are advised to avoid
setting indent to yes as this can expose layout bugs in some browsers. ;See
also: indent-spaces
indent-attributes:no;Default: no ;This option specifies if Tidy should begin
each attribute on a new line.
indent-spaces:0;Type: Integer ;Default: 2 ;Example: 0, 1, 2, ... ;This option
specifies the number of spaces Tidy uses to indent content, when indentation
is enabled.= ;See also: indent
markup:yes;Default: yes ;This option specifies if Tidy should generate a pretty
printed version of the markup. Note that Tidy won’t generate a pretty printed
version if it finds significant errors (see force-output).
punctuation-wrap:no;Default: no ;This option specifies if Tidy should line wrap
after some Unicode or Chinese punctuation characters.
sort-attributes:alpha;Type: enum ;Default: none ;Example: none, alpha ;This
option specifies that tidy should sort attributes within an element using the
specified sort algorithm. If set to "alpha", the algorithm is an ascending
alphabetic sort.
;split;Default: no ;Currently not used. Tidy Classic only.
tab-size:2;Type: Integer ;Default: 8 ;Example: 0, 1, 2, ... ;This option
specifies the number of columns that Tidy uses between successive tab stops.
It is used to map tabs to spaces when reading the input. Tidy never outputs
tabs.
vertical-space:no;Default: no ;This option specifies if Tidy should add some
empty lines for readability.
wrap:208;Type: Integer ;Default: 68 ;Example: 0 (no wrapping), 1, 2, ...
;This option specifies the right margin Tidy uses for line wrapping. Tidy tries
to wrap lines so that they do not exceed this length. Set wrap to zero if you
want to disable line wrapping.
wrap-asp:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within ASP pseudo elements, which look like: <% ... %>.
wrap-attributes:no;Default: no ;This option specifies if Tidy should line wrap
attribute values, for easier editing. This option can be set independently of
wrap-script-literals. ;See also: wrap-script-literals
wrap-jste:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within JSTE pseudo elements, which look like: <# ... #>.
wrap-php:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within PHP pseudo elements, which look like: <?php ... ?>.
wrap-script-literals:no;Default: no ;This option specifies if Tidy should line wrap
string literals that appear in script attributes. Tidy wraps long script string
literals by inserting a backslash character before the line break. ;See also:
wrap-attributes
wrap-sections:no;Default: yes ;This option specifies if Tidy should line wrap
text contained within <![ ... ]> section tags.
;------------------------------------CharacterEncoding
options:--------------------------------------
ascii-chars:no;Default: no ;Can be used to modify behavior of -c (--clean yes)
option. If set to "yes" when using -c, &emdash;, ”, and other named
character entities are downgraded to their closest ascii equivalents. ;See
also: clean
char-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for both the
input and output. For ascii, Tidy will accept Latin-1 (ISO-8859-1) character
values, but will use entities for all characters whose value > 127. For raw, Tidy
will output values above 127 without translating them into entities. For latin1,
characters above 255 will be written as entities. For utf8, Tidy assumes that
both input and output is encoded as UTF-8. You can use iso2022 for files
encoded using the ISO-2022 family of encodings e.g. ISO-2022-JP. For mac and
win1252, Tidy will accept vendor specific character values, but will use entities
for all characters whose value > 127. ;See also: input-encoding,
output-encoding
input-encoding:utf8;Type: Encoding ;Default: latin1 ;Example: raw, ascii,
latin0, latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16,
big5, shiftjis ;This option specifies the character encoding Tidy uses for the
input. See char-encoding for more info. ;See also: char-encoding
language:en;Type: String ;Default: - ;Default: - ;Currently not used, but this
option specifies the language Tidy uses (for instance "en").
;newline;Type: enum ;Default: Platform dependent ;Example: LF, CRLF, CR
;The default is appropriate to the current platform: CRLF on PC-DOS,
MS-Windows and OS/2, CR on Classic Mac OS, and LF everywhere else (Unix
and Linux).
;output-bom;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should write a Unicode Byte Order
Mark character (BOM; also known as Zero Width No-Break Space; has value of
U+FEFF) to the beginning of the output; only for UTF-8 and UTF-16 output
encodings. If set to "auto", this option causes Tidy to write a BOM to the
output only if a BOM was present at the beginning of the input. A BOM is
always written for XML/XHTML output using UTF-16 output encodings.
output-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for the output.
See char-encoding for more info. May only be different from input-encoding for
Latin encodings (ascii, latin0, latin1, mac, win1252, ibm858). ;See also:
char-encoding
;----------------------Miscellaneousoptions:-------------------------------
;error-file;Type: String ;Default: - ;Default: - ;This option specifies the error file
Tidy uses for errors and warnings. Normally errors and warnings are output to
"stderr". ;See also: output-file
;force-output;Default: no ;This option specifies if Tidy should produce output
even if errors are encountered. Use this option with care - if Tidy reports an
error, this means Tidy was not able to, or is not sure how to, fix the error, so
the resulting output may not reflect your intention.
;gnu-emacs;Default: no ;This option specifies if Tidy should change the
format for reporting errors and warnings to a format that is more easily parsed
by GNU Emacs.
;gnu-emacs-file;Type: String ;Default: - Default: - Used internally.
keep-time:yes;Default: no ;This option specifies if Tidy should keep the
original modification time of files that Tidy modifies in place. The default is no.
Setting the option to yes allows you to tidy files without causing these files to
be uploaded to a web server when using a tool such as SiteCopy. Note this
feature is not supported on some platforms.
;output-file;Type: String ;Default: -;Default: - ; This option specifies the output
file Tidy uses for markup. Normally markup is written to "stdout". ;See also:
error-file
;quiet;Default: no ;This option specifies if Tidy should output the summary of
the numbers of errors and warnings, or the welcome or informational
messages.
;slide-style;Type: String ;Default: - ;Default: - ;Currently not used. Tidy Classic
only.
tidy-mark:yes;Default: yes ;This option specifies if Tidy should add a meta
element to the document head to indicate that the document has been tidied.
Tidy won’t add a meta element if one is already present.
write-back:yes;Default: no ;This option specifies if Tidy should write back the
tidied markup to the same file it read from. You are advised to keep copies of
important files before tidying them, as on rare occasions the result may not be
what you expect.
[Tidy- tom indent]
;------------------xml,html, xhtml options--------------------------
add-xml-decl:yes;Example: y/n, yes/no, t/f, true/false, 1/0 ;This option
specifies if Tidy should add the XML declaration when outputting XML or
XHTML. Note that if the input already includes an <?xml ... ?> declaration then
this option will be ignored. If the encoding for the output is different from
"ascii", one of the utf encodings or "raw", the declaration is always added as
required by the XML standard. ;See also: char-encoding, output-encoding
add-xml-space:yes;This option specifies if Tidy should add
xml:space="preserve" to elements such as <PRE>, <STYLE> and <SCRIPT>
when generating XML. This is needed if the whitespace in such elements is to
be parsed appropriately without having access to the DTD.
alt-text:Image;Type: String ;Default: - Default: - ;This option specifies the
default "alt=" text Tidy uses for <IMG> attributes. This feature is dangerous
as it suppresses further accessibility warnings. You are responsible for making
your documents accessible to people who can not see the images!
anchor-as-name:yes;Type: Boolean Default: yes Example: y/n, yes/no, t/f,
true/false, 1/0 This option controls the deletion or addition of the name
attribute in elements where it can serve as anchor. If set to "yes", a name
attribute, if not already existing, is added along an existing id attribute if the
DTD allows it. If set to "no", any existing name attribute is removed if an id
attribute exists or has been added.
assume-xml-procins:no;Default: no;Example: y/n, yes/no, t/f, true/false, 1/0
;This option specifies if Tidy should change the parsing of processing
instructions to require ?> as the terminator rather than >. This option is
automatically set if the input is in XML.
bare:yes;This option specifies if Tidy should strip Microsoft specific HTML from
Word 2000 documents, and output spaces rather than non-breaking spaces
where they exist in the input.
clean:yes;This option specifies if Tidy should strip out surplus presentational
tags and attributes replacing them by style rules and structural markup as
appropriate. It works well on the HTML saved by Microsoft Office products.
;See also: drop-font-tags
css-prefix:c;Type: String ;Default: c ;This option specifies the prefix that Tidy
uses for styles rules. By default, "c" will be used.
decorate-inferred-ul:no;This option specifies if Tidy should decorate inferred UL
elements with some CSS markup to avoid indentation to the right.
doctype:auto;Type: DocType ;Default: auto ;Example: omit, auto, strict,
transitional, user ;This option specifies the DOCTYPE declaration generated
by Tidy. If set to "omit" the output won’t contain a DOCTYPE declaration. If set
to "auto" (the default) Tidy will use an educated guess based upon the
contents of the document. If set to "strict", Tidy will set the DOCTYPE to the
strict DTD. If set to "loose", the DOCTYPE is set to the loose (transitional) DTD.
Alternatively, you can supply a string for the formal public identifier (FPI).
;For example:
;doctype:"-//ACME//DTD HTML 3.14159//EN" ;If you specify the FPI for an XHTML
document, Tidy will set the system identifier to an empty string. For an HTML
document, Tidy adds a system identifier only if one was already present in
order to preserve the processing mode of some browsers. Tidy leaves the
DOCTYPE for generic XML documents unchanged. --doctype omit implies
--numeric-entities yes. This option does not offer a validation of the document
conformance.
drop-empty-paras:yes;This option specifies if Tidy should discard empty
paragraphs.
drop-font-tags:no;Default: no;This option specifies if Tidy should discard
<FONT> and <CENTER> tags without creating the corresponding style rules.
This option can be set independently of the clean option. ;See also: clean
drop-proprietary-attributes:yes;Default: no ;This option specifies if Tidy
should strip out proprietary attributes, such as MS data binding attributes.
enclose-block-text:yes;Default: no ;This option specifies if Tidy should insert
a <P> element to enclose any text it finds in any element that allows mixed
content for HTML transitional but not HTML strict.
enclose-text:yes;Default: no ;This option specifies if Tidy should enclose any
text it finds in the body element within a <P> element. This is useful when you
want to take existing HTML and use it with a style sheet.
escape-cdata:no;Default: no ;This option specifies if Tidy should convert
<![CDATA[]]> sections to normal text.
fix-backslash:yes;Default: yes ;This option specifies if Tidy should replace
backslash characters "\" in URLs by forward slashes "/".
fix-bad-comments:yes;Default: yes ;This option specifies if Tidy should replace
unexpected hyphens with "=" characters when it comes across adjacent
hyphens. The default is yes. This option is provided for users of Cold Fusion
which uses the comment syntax: <!--- --->
fix-uri:yes;Default: yes ;This option specifies if Tidy should check attribute
values that carry URIs for illegal characters and if such are found, escape them
as HTML 4 recommends.
hide-comments:no;Default: no ;This option specifies if Tidy should print out
comments.
hide-endtags:no;Default: no ;This option specifies if Tidy should omit optional
end-tags when generating the pretty printed markup. This option is ignored if
you are outputting to XML.
indent-cdata:no;Default: no ;This option specifies if Tidy should indent
<![CDATA[]]> sections.
input-xml:no;Default: no ;This option specifies if Tidy should use the XML
parser rather than the error correcting HTML parser.
join-classes:no;Default: no ;This option specifies if Tidy should combine class
names to generate a single new class name, if multiple class assignments are
detected on an element. ;See also: join-styles, repeated-attributes
join-styles:yes;Default: yes ;This option specifies if Tidy should combine styles
to generate a single new style, if multiple style values are detected on an
element. ;See also: join-classes, repeated-attributes
literal-attributes:no;Default: no ;This option specifies if Tidy should ensure that
whitespace characters within attribute values are passed through unchanged.
logical-emphasis:yes;Default: no ;This option specifies if Tidy should replace
any occurrence of <I> by <EM> and any occurrence of <B> by <STRONG>.
In both cases, the attributes are preserved unchanged. This option can be set
independently of the clean and drop-font-tags options.
lower-literals:yes;Default: yes ;This option specifies if Tidy should convert the
value of an attribute that takes a list of predefined values to lower case. This is
required for XHTML documents.
merge-divs:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <div> such as
"<div><div>...</div></div>". If set to "auto", the attributes of the inner
<div> are moved to the outer one. As well, nested <div> with ID attributes
are not merged. If set to "yes", the attributes of the inner <div> are discarded
with the exception of "class" and "style". ;See also: clean, merge-spans
merge-spans:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <span> such as
"<span><span>...</span></span>". The algorithm is identical to the one
used by --merge-divs. ;See also: clean, merge-divs
ncr:yes;Default: yes ;This option specifies if Tidy should allow numeric
character references.
;new-blocklevel-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...
;This option specifies new block-level tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
Note you can’t change the content model for elements such as <TABLE>,
<UL>, <OL> and <DL>. This option is ignored in XML mode. ;See also:
new-empty-tags, new-inline-tags, new-pre-tags
;new-empty-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This
option specifies new empty inline tags. This option takes a space or comma
separated list of tag names. Unless you declare new tags, Tidy will refuse to
generate a tidied file if the input includes previously unknown tags. Remember
to also declare empty tags as either inline or blocklevel. This option is ignored
in XML mode. ;See also: new-blocklevel-tags, new-inline-tags, new-pre-tags
;new-inline-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...;This
option specifies new non-empty inline tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
This option is ignored in XML mode. ;See also: new-blocklevel-tags,
new-empty-tags, new-pre-tags
;new-pre-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This option
specifies new tags that are to be processed in exactly the same way as HTML’s
<PRE> element. This option takes a space or comma separated list of tag
names. Unless you declare new tags, Tidy will refuse to generate a tidied file if
the input includes previously unknown tags. Note you can not as yet add new
CDATA elements (similar to <SCRIPT>). This option is ignored in XML mode.
;See also: new-blocklevel-tags, new-empty-tags, new-inline-tags
;numeric-entities;Default: no ;This option specifies if Tidy should output
entities other than the built-in HTML entities (&, <, > and ") in
the numeric rather than the named entity form. Only entities compatible with
the DOCTYPE declaration generated are used. Entities that can be represented
in the output encoding are translated correspondingly. ;See also: doctype,
preserve-entities
output-html:no;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as HTML.
output-xhtml:yes;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as extensible HTML. This option causes Tidy to
set the DOCTYPE and default namespace as appropriate to XHTML. If a
DOCTYPE or namespace is given they will checked for consistency with the
content of the document. In the case of an inconsistency, the corrected values
will appear in the output. For XHTML, entities can be written as named or
numeric entities according to the setting of the "numeric-entities" option. The
original case of tags and attributes will be preserved, regardless of other
options.
output-xml:no;Default: no ;This option specifies if Tidy should pretty print
output, writing it as well-formed XML. Any entities not defined in XML 1.0 will
be written as numeric entities to allow them to be parsed by a XML parser. The
original case of tags and attributes will be preserved, regardless of other
options.
preserve-entities:yes;Default: no ;This option specifies if Tidy should preserve
the well-formed entitites as found in the input.
quote-ampersand:yes;Default: yes ;This option specifies if Tidy should output
unadorned & characters as &.
quote-marks:yes;Default: no ;This option specifies if Tidy should output "
characters as " as is preferred by some editing environments. The
apostrophe character ’ is written out as ' since many web browsers don’t
yet support '.
quote-nbsp:yes;Default: yes ;This option specifies if Tidy should output
non-breaking space characters as entities, rather than as the Unicode
character value 160 (decimal).
repeated-attributes:keep-last;Type: enum ;Default: keep-last ;Example:
keep-first, keep-last ;This option specifies if Tidy should keep the first or last
attribute, if an attribute is repeated, e.g. has two align attributes. ;See also:
join-classes, join-styles
replace-color:yes;Default: no ;This option specifies if Tidy should replace
numeric values in color attributes by HTML/XHTML color names where defined,
e.g. replace "#ffffff" with "white".
show-body-only:no;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should print only the contents of the
body tag as an HTML fragment. If set to "auto", this is performed only if the
body tag has been inferred. Useful for incorporating existing whole pages as a
portion of another page.
uppercase-attributes:no;Default: no ;This option specifies if Tidy should output
attribute names in upper case. The default is no, which results in lower case
attribute names, except for XML input, where the original case is preserved.
uppercase-tags:no;Default: no ;This option specifies if Tidy should output tag
names in upper case. The default is no, which results in lower case tag names,
except for XML input, where the original case is preserved.
word-2000:yes;Default: no ;This option specifies if Tidy should go to great
pains to strip out all the surplus stuff Microsoft Word 2000 inserts when you
save Word documents as "Web pages". Doesn’t handle embedded images or
VML. You should consider using Word’s "Save As: Web Page, Filtered".
;------Diagnosticsoptions:-------------------------------------
accessibility-check:3;Type: enum ;Default: 0 (Tidy Classic) ;Example: 0 (Tidy
Classic), 1 (Priority 1 Checks), 2 (Priority 2 Checks), 3 (Priority 3 Checks) ;This
option specifies what level of accessibility checking, if any, that Tidy should
do. Level 0 is equivalent to Tidy Classic’s accessibility checking. For more
information on Tidy’s accessibility checking, visit the Adaptive Technology
Resource Centre at the University of Toronto at
http://www.aprompt.ca/Tidy/accessibilitychecks.html.
show-errors:10 ;Type: Integer ;Default: 6 ;Example: 0, 1, 2, ... ;This option
specifies the number Tidy uses to determine if further errors should be shown.
If set to 0, then no errors are shown.
show-warnings:yes;Default: yes ;This option specifies if Tidy should suppress
warnings. This can be useful when a few errors are hidden in a flurry of
warnings.
;----------------------------PrettyPrint options:-----------------------------
break-before-br:no;Default: no ;This option specifies if Tidy should output a line
break before each <BR> element.
indent:yes;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should indent block-level tags. If set
to "auto", this option causes Tidy to decide whether or not to indent the
content of tags such as TITLE, H1-H6, LI, TD, TD, or P depending on whether or
not the content includes a block-level element. You are advised to avoid
setting indent to yes as this can expose layout bugs in some browsers. ;See
also: indent-spaces
indent-attributes:yes;Default: no ;This option specifies if Tidy should begin
each attribute on a new line.
indent-spaces:2;Type: Integer ;Default: 2 ;Example: 0, 1, 2, ... ;This option
specifies the number of spaces Tidy uses to indent content, when indentation
is enabled.= ;See also: indent
markup:yes;Default: yes ;This option specifies if Tidy should generate a pretty
printed version of the markup. Note that Tidy won’t generate a pretty printed
version if it finds significant errors (see force-output).
punctuation-wrap:yes;Default: no ;This option specifies if Tidy should line wrap
after some Unicode or Chinese punctuation characters.
sort-attributes:alpha;Type: enum ;Default: none ;Example: none, alpha ;This
option specifies that tidy should sort attributes within an element using the
specified sort algorithm. If set to "alpha", the algorithm is an ascending
alphabetic sort.
;split;Default: no ;Currently not used. Tidy Classic only.
tab-size:2;Type: Integer ;Default: 8 ;Example: 0, 1, 2, ... ;This option
specifies the number of columns that Tidy uses between successive tab stops.
It is used to map tabs to spaces when reading the input. Tidy never outputs
tabs.
vertical-space:no;Default: no ;This option specifies if Tidy should add some
empty lines for readability.
wrap:208;Type: Integer ;Default: 68 ;Example: 0 (no wrapping), 1, 2, ...
;This option specifies the right margin Tidy uses for line wrapping. Tidy tries
to wrap lines so that they do not exceed this length. Set wrap to zero if you
want to disable line wrapping.
wrap-asp:yes;Default: yes ;This option specifies if Tidy should line wrap text
contained within ASP pseudo elements, which look like: <% ... %>.
wrap-attributes:yes;Default: no ;This option specifies if Tidy should line wrap
attribute values, for easier editing. This option can be set independently of
wrap-script-literals. ;See also: wrap-script-literals
wrap-jste:yes;Default: yes ;This option specifies if Tidy should line wrap text
contained within JSTE pseudo elements, which look like: <# ... #>.
wrap-php:yes;Default: yes ;This option specifies if Tidy should line wrap text
contained within PHP pseudo elements, which look like: <?php ... ?>.
wrap-script-literals:yes;Default: no ;This option specifies if Tidy should line
wrap string literals that appear in script attributes. Tidy wraps long script
string literals by inserting a backslash character before the line break. ;See
also: wrap-attributes
wrap-sections:yes;Default: yes ;This option specifies if Tidy should line wrap
text contained within <![ ... ]> section tags.
;------------------------------------CharacterEncoding
options:--------------------------------------
ascii-chars:no;Default: no ;Can be used to modify behavior of -c (--clean yes)
option. If set to "yes" when using -c, &emdash;, ”, and other named
character entities are downgraded to their closest ascii equivalents. ;See
also: clean
char-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for both the
input and output. For ascii, Tidy will accept Latin-1 (ISO-8859-1) character
values, but will use entities for all characters whose value > 127. For raw, Tidy
will output values above 127 without translating them into entities. For latin1,
characters above 255 will be written as entities. For utf8, Tidy assumes that
both input and output is encoded as UTF-8. You can use iso2022 for files
encoded using the ISO-2022 family of encodings e.g. ISO-2022-JP. For mac and
win1252, Tidy will accept vendor specific character values, but will use entities
for all characters whose value > 127. ;See also: input-encoding,
output-encoding
input-encoding:utf8;Type: Encoding ;Default: latin1 ;Example: raw, ascii,
latin0, latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16,
big5, shiftjis ;This option specifies the character encoding Tidy uses for the
input. See char-encoding for more info. ;See also: char-encoding
language:en;Type: String ;Default: - ;Default: - ;Currently not used, but this
option specifies the language Tidy uses (for instance "en").
;newline;Type: enum ;Default: Platform dependent ;Example: LF, CRLF, CR
;The default is appropriate to the current platform: CRLF on PC-DOS,
MS-Windows and OS/2, CR on Classic Mac OS, and LF everywhere else (Unix
and Linux).
;output-bom;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should write a Unicode Byte Order
Mark character (BOM; also known as Zero Width No-Break Space; has value of
U+FEFF) to the beginning of the output; only for UTF-8 and UTF-16 output
encodings. If set to "auto", this option causes Tidy to write a BOM to the
output only if a BOM was present at the beginning of the input. A BOM is
always written for XML/XHTML output using UTF-16 output encodings.
output-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for the output.
See char-encoding for more info. May only be different from input-encoding for
Latin encodings (ascii, latin0, latin1, mac, win1252, ibm858). ;See also:
char-encoding
;----------------------Miscellaneousoptions:-------------------------------
;error-file;Type: String ;Default: - ;Default: - ;This option specifies the error file
Tidy uses for errors and warnings. Normally errors and warnings are output to
"stderr". ;See also: output-file
;force-output;Default: no ;This option specifies if Tidy should produce output
even if errors are encountered. Use this option with care - if Tidy reports an
error, this means Tidy was not able to, or is not sure how to, fix the error, so
the resulting output may not reflect your intention.
;gnu-emacs;Default: no ;This option specifies if Tidy should change the
format for reporting errors and warnings to a format that is more easily parsed
by GNU Emacs.
;gnu-emacs-file;Type: String ;Default: - Default: - Used internally.
keep-time:yes;Default: no ;This option specifies if Tidy should keep the
original modification time of files that Tidy modifies in place. The default is no.
Setting the option to yes allows you to tidy files without causing these files to
be uploaded to a web server when using a tool such as SiteCopy. Note this
feature is not supported on some platforms.
;output-file;Type: String ;Default: -;Default: - ; This option specifies the output
file Tidy uses for markup. Normally markup is written to "stdout". ;See also:
error-file
;quiet;Default: no ;This option specifies if Tidy should output the summary of
the numbers of errors and warnings, or the welcome or informational
messages.
;slide-style;Type: String ;Default: - ;Default: - ;Currently not used. Tidy Classic
only.
tidy-mark:yes;Default: yes ;This option specifies if Tidy should add a meta
element to the document head to indicate that the document has been tidied.
Tidy won’t add a meta element if one is already present.
write-back:yes;Default: no ;This option specifies if Tidy should write back the
tidied markup to the same file it read from. You are advised to keep copies of
important files before tidying them, as on rare occasions the result may not be
what you expect.
[TiDyclean Document - wrap]
bare:no
clean:no
fix-bad-comments:yes
fix-backslash:yes
indent:yes
indent-attributes:yes
indent-spaces:2
char-encoding:ascii
quote-ampersand:yes
quote-nbsp:yes
replace-color:yes
word-2000:yes
wrap-attributes:yes
wrap-asp:yes
wrap-jste:yes
wrap-php:yes
write-back:yes
wrap:208
break-before-br:yes
[TiDyclean Document - nowrap]
bare:no
char-encoding:ascii
clean:no
fix-bad-comments:yes
fix-backslash:yes
indent:yes
indent-attributes:no
indent-spaces:2
quote-ampersand:yes
quote-nbsp:yes
replace-color:yes
wrap-asp:no
wrap-jste:no
wrap-php:no
write-back:yes
wrap:208
break-before-br:yes
[TiDyclean Document - nowrap, no indent]
bare:no
clean:no
fix-bad-comments:yes
fix-backslash:yes
indent:no
indent-attributes:no
indent-spaces:2
char-encoding:ascii
quote-ampersand:yes
quote-nbsp:no
word-2000:yes
replace-color:yes
wrap-asp:no
wrap-jste:no
wrap-php:no
write-back:yes
wrap:208
break-before-br:yes
[TiDyclean Microsoft Word 2000 document]
doctype:auto
drop-empty-paras:yes
fix-bad-comments:yes
fix-uri:yes
join-styles:yes
lower-literals:yes
ncr:yes
quote-ampersand:yes
quote-nbsp:yes
word-2000:yes
markup:yes
wrap-jste:yes
wrap-php:yes
wrap-section:yes
write-back:yes
wrap:208
break-before-br:yes
[TiDyclean Document - quote]
quote-ampersand:yes
quote-marks:yes
quote-nbsp:yes
write-back:yes
wrap:208
break-before-br:yes
[TiDyupgrade to CSS]
add-xml-decl:no
add-xml-pi:no
alt-text:Image
clean:yes
doctype:auto
drop-empty-paras:no
drop-font-tags:yes
fix-backslash:yes
fix-bad-comments:yes
hide-endtags:no
char-encoding:ascii
indent:yes
indent-attributes:yes
indent-spaces:2
input-xml:no
markup:yes
quiet:yes
quote-ampersand:yes
quote-nbsp:yes
tidy-mark:no
uppercase-attributes:no
uppercase-tags:no
word-2000:yes
wrap-asp:yes
wrap-attributes:yes
wrap-jste:yes
wrap-php:yes
write-back:yes
wrap:208
break-before-br:yes
[TiDyconvert to XML]
add-xml-decl:yes
add-xml-pi:yes
alt-text:Image
clean:yes
doctype:AUTO
drop-empty-paras:no
drop-font-tags:yes
fix-bad-comments:yes
hide-endtags:no
char-encoding:ascii
indent:yes
indent-spaces:2
input-xml:no
markup:yes
output-xml:yes
quiet:yes
quote-ampersand:yes
quote-nbsp:yes
tidy-mark:no
uppercase-attributes:no
uppercase-tags:no
word-2000:yes
wrap-asp:yes
wrap-attributes:yes
wrap-jste:yes
wrap-php:yes
write-back:yes
wrap:208
break-before-br:yes
[Tidyconvert to XHTML]
accessibility-check:3
add-xml-decl:yes
add-xml-space:no
add-xml-pi:no
alt-text:Image
doctype:auto
drop-empty-paras:yes
drop-font-tags:yes
drop-proprietary-attributes:yes
error-file:error.log
fix-backslash:yes
fix-bad-comments:yes
hide-endtags:no
char-encoding:ascii
indent:yes
indent-spaces:2
indent-attributes:yes
input-xml:no
literal-attributes:yes
logical-emphasis:yes
lower-literals:yes
markup:yes
output-xml:no
punctuation-wrap:yes
quiet:yes
replace-color:yes
sort-attributes:yes
tidy-mark:yes
uppercase-attributes:no
uppercase-tags:no
word-2000:no
wrap-asp:yes
wrap-attributes:yes
wrap-jste:yes
wrap-php:yes
write-back:yes
wrap:208
break-before-br:yes
clean:yes
output-xhtml:yes
[Tidywrap 4095]
; note, only the pretty print is different between tom indent /
no-indent---------------------
;------------------xml,html, xhtml options--------------------------
add-xml-decl:yes;Example: y/n, yes/no, t/f, true/false, 1/0 ;This option
specifies if Tidy should add the XML declaration when outputting XML or
XHTML. Note that if the input already includes an <?xml ... ?> declaration then
this option will be ignored. If the encoding for the output is different from
"ascii", one of the utf encodings or "raw", the declaration is always added as
required by the XML standard. ;See also: char-encoding, output-encoding
add-xml-space:yes;This option specifies if Tidy should add
xml:space="preserve" to elements such as <PRE>, <STYLE> and <SCRIPT>
when generating XML. This is needed if the whitespace in such elements is to
be parsed appropriately without having access to the DTD.
alt-text:Image;Type: String ;Default: - Default: - ;This option specifies the
default "alt=" text Tidy uses for <IMG> attributes. This feature is dangerous
as it suppresses further accessibility warnings. You are responsible for making
your documents accessible to people who can not see the images!
anchor-as-name:yes;Type: Boolean Default: yes Example: y/n, yes/no, t/f,
true/false, 1/0 This option controls the deletion or addition of the name
attribute in elements where it can serve as anchor. If set to "yes", a name
attribute, if not already existing, is added along an existing id attribute if the
DTD allows it. If set to "no", any existing name attribute is removed if an id
attribute exists or has been added.
assume-xml-procins:no;Default: no;Example: y/n, yes/no, t/f, true/false, 1/0
;This option specifies if Tidy should change the parsing of processing
instructions to require ?> as the terminator rather than >. This option is
automatically set if the input is in XML.
bare:yes;This option specifies if Tidy should strip Microsoft specific HTML from
Word 2000 documents, and output spaces rather than non-breaking spaces
where they exist in the input.
clean:no;This option specifies if Tidy should strip out surplus presentational
tags and attributes replacing them by style rules and structural markup as
appropriate. It works well on the HTML saved by Microsoft Office products.
;See also: drop-font-tags
css-prefix:c;Type: String ;Default: c ;This option specifies the prefix that Tidy
uses for styles rules. By default, "c" will be used.
decorate-inferred-ul:no;This option specifies if Tidy should decorate inferred UL
elements with some CSS markup to avoid indentation to the right.
doctype:auto;Type: DocType ;Default: auto ;Example: omit, auto, strict,
transitional, user ;This option specifies the DOCTYPE declaration generated
by Tidy. If set to "omit" the output won’t contain a DOCTYPE declaration. If set
to "auto" (the default) Tidy will use an educated guess based upon the
contents of the document. If set to "strict", Tidy will set the DOCTYPE to the
strict DTD. If set to "loose", the DOCTYPE is set to the loose (transitional) DTD.
Alternatively, you can supply a string for the formal public identifier (FPI).
;For example:
;doctype:"-//ACME//DTD HTML 3.14159//EN" ;If you specify the FPI for an XHTML
document, Tidy will set the system identifier to an empty string. For an HTML
document, Tidy adds a system identifier only if one was already present in
order to preserve the processing mode of some browsers. Tidy leaves the
DOCTYPE for generic XML documents unchanged. --doctype omit implies
--numeric-entities yes. This option does not offer a validation of the document
conformance.
drop-empty-paras:yes;This option specifies if Tidy should discard empty
paragraphs.
drop-font-tags:no;Default: no;This option specifies if Tidy should discard
<FONT> and <CENTER> tags without creating the corresponding style rules.
This option can be set independently of the clean option. ;See also: clean
drop-proprietary-attributes:yes;Default: no ;This option specifies if Tidy
should strip out proprietary attributes, such as MS data binding attributes.
enclose-block-text:yes;Default: no ;This option specifies if Tidy should insert
a <P> element to enclose any text it finds in any element that allows mixed
content for HTML transitional but not HTML strict.
enclose-text:yes;Default: no ;This option specifies if Tidy should enclose any
text it finds in the body element within a <P> element. This is useful when you
want to take existing HTML and use it with a style sheet.
escape-cdata:no;Default: no ;This option specifies if Tidy should convert
<![CDATA[]]> sections to normal text.
fix-backslash:yes;Default: yes ;This option specifies if Tidy should replace
backslash characters "\" in URLs by forward slashes "/".
fix-bad-comments:yes;Default: yes ;This option specifies if Tidy should replace
unexpected hyphens with "=" characters when it comes across adjacent
hyphens. The default is yes. This option is provided for users of Cold Fusion
which uses the comment syntax: <!--- --->
fix-uri:yes;Default: yes ;This option specifies if Tidy should check attribute
values that carry URIs for illegal characters and if such are found, escape them
as HTML 4 recommends.
hide-comments:no;Default: no ;This option specifies if Tidy should print out
comments.
hide-endtags:no;Default: no ;This option specifies if Tidy should omit optional
end-tags when generating the pretty printed markup. This option is ignored if
you are outputting to XML.
indent-cdata:no;Default: no ;This option specifies if Tidy should indent
<![CDATA[]]> sections.
input-xml:no;Default: no ;This option specifies if Tidy should use the XML
parser rather than the error correcting HTML parser.
join-classes:no;Default: no ;This option specifies if Tidy should combine class
names to generate a single new class name, if multiple class assignments are
detected on an element. ;See also: join-styles, repeated-attributes
join-styles:yes;Default: yes ;This option specifies if Tidy should combine styles
to generate a single new style, if multiple style values are detected on an
element. ;See also: join-classes, repeated-attributes
literal-attributes:no;Default: no ;This option specifies if Tidy should ensure that
whitespace characters within attribute values are passed through unchanged.
logical-emphasis:yes;Default: no ;This option specifies if Tidy should replace
any occurrence of <I> by <EM> and any occurrence of <B> by <STRONG>.
In both cases, the attributes are preserved unchanged. This option can be set
independently of the clean and drop-font-tags options.
lower-literals:yes;Default: yes ;This option specifies if Tidy should convert the
value of an attribute that takes a list of predefined values to lower case. This is
required for XHTML documents.
merge-divs:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <div> such as
"<div><div>...</div></div>". If set to "auto", the attributes of the inner
<div> are moved to the outer one. As well, nested <div> with ID attributes
are not merged. If set to "yes", the attributes of the inner <div> are discarded
with the exception of "class" and "style". ;See also: clean, merge-spans
merge-spans:auto;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;Can be used to modify behavior of -c (--clean yes) option. This
option specifies if Tidy should merge nested <span> such as
"<span><span>...</span></span>". The algorithm is identical to the one
used by --merge-divs. ;See also: clean, merge-divs
ncr:yes;Default: yes ;This option specifies if Tidy should allow numeric
character references.
;new-blocklevel-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...
;This option specifies new block-level tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
Note you can’t change the content model for elements such as <TABLE>,
<UL>, <OL> and <DL>. This option is ignored in XML mode. ;See also:
new-empty-tags, new-inline-tags, new-pre-tags
;new-empty-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This
option specifies new empty inline tags. This option takes a space or comma
separated list of tag names. Unless you declare new tags, Tidy will refuse to
generate a tidied file if the input includes previously unknown tags. Remember
to also declare empty tags as either inline or blocklevel. This option is ignored
in XML mode. ;See also: new-blocklevel-tags, new-inline-tags, new-pre-tags
;new-inline-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ...;This
option specifies new non-empty inline tags. This option takes a space or
comma separated list of tag names. Unless you declare new tags, Tidy will
refuse to generate a tidied file if the input includes previously unknown tags.
This option is ignored in XML mode. ;See also: new-blocklevel-tags,
new-empty-tags, new-pre-tags
;new-pre-tags;Type: Tag names ;Default: - ;Example: tagX, tagY, ... ;This option
specifies new tags that are to be processed in exactly the same way as HTML’s
<PRE> element. This option takes a space or comma separated list of tag
names. Unless you declare new tags, Tidy will refuse to generate a tidied file if
the input includes previously unknown tags. Note you can not as yet add new
CDATA elements (similar to <SCRIPT>). This option is ignored in XML mode.
;See also: new-blocklevel-tags, new-empty-tags, new-inline-tags
;numeric-entities;Default: no ;This option specifies if Tidy should output
entities other than the built-in HTML entities (&, <, > and ") in
the numeric rather than the named entity form. Only entities compatible with
the DOCTYPE declaration generated are used. Entities that can be represented
in the output encoding are translated correspondingly. ;See also: doctype,
preserve-entities
output-html:no;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as HTML.
output-xhtml:yes;Default: no ;This option specifies if Tidy should generate
pretty printed output, writing it as extensible HTML. This option causes Tidy to
set the DOCTYPE and default namespace as appropriate to XHTML. If a
DOCTYPE or namespace is given they will checked for consistency with the
content of the document. In the case of an inconsistency, the corrected values
will appear in the output. For XHTML, entities can be written as named or
numeric entities according to the setting of the "numeric-entities" option. The
original case of tags and attributes will be preserved, regardless of other
options.
output-xml:no;Default: no ;This option specifies if Tidy should pretty print
output, writing it as well-formed XML. Any entities not defined in XML 1.0 will
be written as numeric entities to allow them to be parsed by a XML parser. The
original case of tags and attributes will be preserved, regardless of other
options.
preserve-entities:yes;Default: no ;This option specifies if Tidy should preserve
the well-formed entitites as found in the input.
quote-ampersand:yes;Default: yes ;This option specifies if Tidy should output
unadorned & characters as &.
quote-marks:yes;Default: no ;This option specifies if Tidy should output "
characters as " as is preferred by some editing environments. The
apostrophe character ’ is written out as ' since many web browsers don’t
yet support '.
quote-nbsp:yes;Default: yes ;This option specifies if Tidy should output
non-breaking space characters as entities, rather than as the Unicode
character value 160 (decimal).
repeated-attributes:keep-last;Type: enum ;Default: keep-last ;Example:
keep-first, keep-last ;This option specifies if Tidy should keep the first or last
attribute, if an attribute is repeated, e.g. has two align attributes. ;See also:
join-classes, join-styles
replace-color:yes;Default: no ;This option specifies if Tidy should replace
numeric values in color attributes by HTML/XHTML color names where defined,
e.g. replace "#ffffff" with "white".
show-body-only:no;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should print only the contents of the
body tag as an HTML fragment. If set to "auto", this is performed only if the
body tag has been inferred. Useful for incorporating existing whole pages as a
portion of another page.
uppercase-attributes:no;Default: no ;This option specifies if Tidy should output
attribute names in upper case. The default is no, which results in lower case
attribute names, except for XML input, where the original case is preserved.
uppercase-tags:no;Default: no ;This option specifies if Tidy should output tag
names in upper case. The default is no, which results in lower case tag names,
except for XML input, where the original case is preserved.
word-2000:yes;Default: no ;This option specifies if Tidy should go to great
pains to strip out all the surplus stuff Microsoft Word 2000 inserts when you
save Word documents as "Web pages". Doesn’t handle embedded images or
VML. You should consider using Word’s "Save As: Web Page, Filtered".
;------Diagnosticsoptions:-------------------------------------
accessibility-check:3;Type: enum ;Default: 0 (Tidy Classic) ;Example: 0 (Tidy
Classic), 1 (Priority 1 Checks), 2 (Priority 2 Checks), 3 (Priority 3 Checks) ;This
option specifies what level of accessibility checking, if any, that Tidy should
do. Level 0 is equivalent to Tidy Classic’s accessibility checking. For more
information on Tidy’s accessibility checking, visit the Adaptive Technology
Resource Centre at the University of Toronto at
http://www.aprompt.ca/Tidy/accessibilitychecks.html.
show-errors:10 ;Type: Integer ;Default: 6 ;Example: 0, 1, 2, ... ;This option
specifies the number Tidy uses to determine if further errors should be shown.
If set to 0, then no errors are shown.
show-warnings:yes;Default: yes ;This option specifies if Tidy should suppress
warnings. This can be useful when a few errors are hidden in a flurry of
warnings.
;----------------------------PrettyPrint options:-----------------------------
break-before-br:no;Default: no ;This option specifies if Tidy should output a line
break before each <BR> element.
indent:no;Type: AutoBool ;Default: no ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should indent block-level tags. If set
to "auto", this option causes Tidy to decide whether or not to indent the
content of tags such as TITLE, H1-H6, LI, TD, TD, or P depending on whether or
not the content includes a block-level element. You are advised to avoid
setting indent to yes as this can expose layout bugs in some browsers. ;See
also: indent-spaces
indent-attributes:no;Default: no ;This option specifies if Tidy should begin
each attribute on a new line.
indent-spaces:0;Type: Integer ;Default: 2 ;Example: 0, 1, 2, ... ;This option
specifies the number of spaces Tidy uses to indent content, when indentation
is enabled.= ;See also: indent
markup:yes;Default: yes ;This option specifies if Tidy should generate a pretty
printed version of the markup. Note that Tidy won’t generate a pretty printed
version if it finds significant errors (see force-output).
punctuation-wrap:no;Default: no ;This option specifies if Tidy should line wrap
after some Unicode or Chinese punctuation characters.
sort-attributes:alpha;Type: enum ;Default: none ;Example: none, alpha ;This
option specifies that tidy should sort attributes within an element using the
specified sort algorithm. If set to "alpha", the algorithm is an ascending
alphabetic sort.
;split;Default: no ;Currently not used. Tidy Classic only.
tab-size:2;Type: Integer ;Default: 8 ;Example: 0, 1, 2, ... ;This option
specifies the number of columns that Tidy uses between successive tab stops.
It is used to map tabs to spaces when reading the input. Tidy never outputs
tabs.
vertical-space:no;Default: no ;This option specifies if Tidy should add some
empty lines for readability.
wrap:4095;Type: Integer ;Default: 68 ;Example: 0 (no wrapping), 1, 2, ...
;This option specifies the right margin Tidy uses for line wrapping. Tidy tries
to wrap lines so that they do not exceed this length. Set wrap to zero if you
want to disable line wrapping.
wrap-asp:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within ASP pseudo elements, which look like: <% ... %>.
wrap-attributes:no;Default: no ;This option specifies if Tidy should line wrap
attribute values, for easier editing. This option can be set independently of
wrap-script-literals. ;See also: wrap-script-literals
wrap-jste:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within JSTE pseudo elements, which look like: <# ... #>.
wrap-php:no;Default: yes ;This option specifies if Tidy should line wrap text
contained within PHP pseudo elements, which look like: <?php ... ?>.
wrap-script-literals:no;Default: no ;This option specifies if Tidy should line wrap
string literals that appear in script attributes. Tidy wraps long script string
literals by inserting a backslash character before the line break. ;See also:
wrap-attributes
wrap-sections:no;Default: yes ;This option specifies if Tidy should line wrap
text contained within <![ ... ]> section tags.
;------------------------------------CharacterEncoding
options:--------------------------------------
ascii-chars:no;Default: no ;Can be used to modify behavior of -c (--clean yes)
option. If set to "yes" when using -c, &emdash;, ”, and other named
character entities are downgraded to their closest ascii equivalents. ;See
also: clean
char-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for both the
input and output. For ascii, Tidy will accept Latin-1 (ISO-8859-1) character
values, but will use entities for all characters whose value > 127. For raw, Tidy
will output values above 127 without translating them into entities. For latin1,
characters above 255 will be written as entities. For utf8, Tidy assumes that
both input and output is encoded as UTF-8. You can use iso2022 for files
encoded using the ISO-2022 family of encodings e.g. ISO-2022-JP. For mac and
win1252, Tidy will accept vendor specific character values, but will use entities
for all characters whose value > 127. ;See also: input-encoding,
output-encoding
input-encoding:utf8;Type: Encoding ;Default: latin1 ;Example: raw, ascii,
latin0, latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16,
big5, shiftjis ;This option specifies the character encoding Tidy uses for the
input. See char-encoding for more info. ;See also: char-encoding
language:en;Type: String ;Default: - ;Default: - ;Currently not used, but this
option specifies the language Tidy uses (for instance "en").
;newline;Type: enum ;Default: Platform dependent ;Example: LF, CRLF, CR
;The default is appropriate to the current platform: CRLF on PC-DOS,
MS-Windows and OS/2, CR on Classic Mac OS, and LF everywhere else (Unix
and Linux).
;output-bom;Type: AutoBool ;Default: auto ;Example: auto, y/n, yes/no, t/f,
true/false, 1/0 ;This option specifies if Tidy should write a Unicode Byte Order
Mark character (BOM; also known as Zero Width No-Break Space; has value of
U+FEFF) to the beginning of the output; only for UTF-8 and UTF-16 output
encodings. If set to "auto", this option causes Tidy to write a BOM to the
output only if a BOM was present at the beginning of the input. A BOM is
always written for XML/XHTML output using UTF-16 output encodings.
output-encoding:ascii;Type: Encoding ;Default: ascii ;Example: raw, ascii, latin0,
latin1, utf8, iso2022, mac, win1252, ibm858, utf16le, utf16be, utf16, big5,
shiftjis ;This option specifies the character encoding Tidy uses for the output.
See char-encoding for more info. May only be different from input-encoding for
Latin encodings (ascii, latin0, latin1, mac, win1252, ibm858). ;See also:
char-encoding
;----------------------Miscellaneousoptions:-------------------------------
;error-file;Type: String ;Default: - ;Default: - ;This option specifies the error file
Tidy uses for errors and warnings. Normally errors and warnings are output to
"stderr". ;See also: output-file
;force-output;Default: no ;This option specifies if Tidy should produce output
even if errors are encountered. Use this option with care - if Tidy reports an
error, this means Tidy was not able to, or is not sure how to, fix the error, so
the resulting output may not reflect your intention.
;gnu-emacs;Default: no ;This option specifies if Tidy should change the
format for reporting errors and warnings to a format that is more easily parsed
by GNU Emacs.
;gnu-emacs-file;Type: String ;Default: - Default: - Used internally.
keep-time:yes;Default: no ;This option specifies if Tidy should keep the
original modification time of files that Tidy modifies in place. The default is no.
Setting the option to yes allows you to tidy files without causing these files to
be uploaded to a web server when using a tool such as SiteCopy. Note this
feature is not supported on some platforms.
;output-file;Type: String ;Default: -;Default: - ; This option specifies the output
file Tidy uses for markup. Normally markup is written to "stdout". ;See also:
error-file
;quiet;Default: no ;This option specifies if Tidy should output the summary of
the numbers of errors and warnings, or the welcome or informational
messages.
;slide-style;Type: String ;Default: - ;Default: - ;Currently not used. Tidy Classic
only.
tidy-mark:yes;Default: yes ;This option specifies if Tidy should add a meta
element to the document head to indicate that the document has been tidied.
Tidy won’t add a meta element if one is already present.
write-back:yes;Default: no ;This option specifies if Tidy should write back the
tidied markup to the same file it read from. You are advised to keep copies of
important files before tidying them, as on rare occasions the result may not be
what you expect.
Peace, tOM Trottier
-- Absum! -- Quidquid latine dictum sit altum viditur.
tOM Trottier +1 613 860-6633
tOM@Abacurial.com Skype:Abacurial
46D Chesterton Dr, Ottawa, ON K2E 5S9, Canada
http://Information.Architecture.Abacurial.com
P Est-ce c'est necessaire d'imprimer ce courriel ?
Do you really need to print this email?
PUBLIC NOTICE: Any use of this message, in any
manner whatsoever, will increase the amount of
disorder in the universe. Although no liability is implied
herein, the consumer is warned that this process will
ultimately lead to the heat death of the universe.
The following section of this message contains a file attachment
prepared for transmission using the Internet MIME message format.
If you are using Pegasus Mail, or any other MIME-compliant system,
you should be able to save it or view it from within your mailer.
If you cannot, please ask your system administrator for assistance.
---- File information -----------
File: IMG_0152-tom-200-10.jpg
Date: 10 Feb 2013, 3:37
Size: 10056 bytes.
Type: Unknown
Attachments
- Application/Octet-stream attachment: IMG_0152-tom-200-10.jpg
Received on Thursday, 18 April 2013 08:35:06 UTC