W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2000

Tidy: report of 2 problems

From: <jpo@lsd.di.uminho.pt>
Date: Sun, 27 Aug 2000 14:33:46 -0400 (EDT)
Message-ID: <20000827183340.4858.qmail@homer.lsd.di.uminho.pt>
To: html-tidy@w3.org
Hi,

It is just to report two small problems I detected while using the Tidy
utility (version of 2000-08-04).

1 - hexadecimal escape sequences
################################

Tidy doesn't recognize hexadecimal escape sequences with an
uppercase 'x' (format &#X...;).

In the following example, the escape sequence "&#X3e;" should
have been replaced by "&gt;"

----------
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
	"http://www.w3.org/TR/html4/strict.dtd">
<html lang="en">
<head>
	<title>Escape sequences</title>
</head>
<body>
<p>#include &lt;stdio.h&gt;</p>
<p>#include &#60;stdio.h&#62;</p>
<p>#include &#x3C;stdio.h&#X3e;</p>
</body>
</html>
----------

Warning generated:

----------
Tidy (vers 4th August 2000) Parsing "escape_sequences.html"
line 11 column 26 - Warning: unescaped & or unknown entity "&#X3e"

escape_sequences.html: Doctype given is "-//W3C//DTD HTML 4.01//EN"
escape_sequences.html: Document content looks like HTML 4.01
1 warnings/errors were found!
----------



2 - doctype declaration
#######################

Tidy generates a warning if there is a new line between the
word PUBLIC and the string "-//W3C//DTD HTML 4.01//EN".

----------
<!DOCTYPE HTML PUBLIC
	"-//W3C//DTD HTML 4.01//EN"
    "http://www.w3.org/TR/html4/strict.dtd">
<html>
<head>
	<title>Title</title>
</head>
<body>
</body>
</html>
----------

Warning generated:

----------
Tidy (vers 4th August 2000) Parsing "doctype_not_ok.html"
line 12 column 1 - Warning: expected "html PUBLIC" or "html SYSTEM"

doctype_not_ok.html: Doctype given is "-//W3C//DTD HTML 4.01//EN"
doctype_not_ok.html: Document content looks like HTML 2.0
1 warnings/errors were found!
----------


sincerely,
jpo
-- 
Jose Pedro Garcia de Oliveira
mailto:jpo@di.uminho.pt * http://gsd.di.uminho.pt/~jpo
Received on Monday, 28 August 2000 00:58:25 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:44 GMT