- From: Richard A. O'Keefe <ok@atlas.otago.ac.nz>
- Date: Mon, 31 Jul 2000 12:48:48 +1200 (NZST)
- To: Barney.Wol@noctua.demon.co.uk, html-tidy@w3.org
Barney Wol <Barney.Wol@noctua.demon.co.uk> wrote:
Another contributor to the list recently pointed out that the
HTML spec actually specifies the DOS standard CR/LF format for best
compatibility. Even though there is a small file-size overhead in
this old "teletype" standard, it is none-the-less the standard, and
said overhead is really quite small.
Well no, it doesn't. Here's what it says:
On the Internet, some platforms use carriage return line feed pairs
for line breaks, some use just line feeds, and others just carriage
returns. As a result, HTML user agents should consider both
isolated line feed and carriage return chracters as line breaks,
with carriage return line feed pairs treated as single line breaks.
That is,
<new line> = CR LF | CR {not followed by LF} | LF.
Concerning line break after <img src="...>,
the specification goes on to say in the very next paragraph
A line break occurring immediately following a start tag
should be discarded, as should a line break occurring immediately
before an end tag. This applies to all HTML elements without
exception.
That is to say, any HTML user agent (browser) that treats
"<img src='foo/bar'>
and the best of luck" in any way differently from
"<img src='foo/bar'>and the best of luck"
is ***wrong***, "without exception" a line break "should be discarded"
"immediately following a start tag" such as <img...>.
Tidy is within its rights putting a line break there.
Since there are broken browsers out there, it may be unwise for Tidy
to insist on its rights, but let's be clear about where the fault lies.
Received on Sunday, 30 July 2000 20:49:03 UTC