W3C home > Mailing lists > Public > www-validator@w3.org > April 2005

Re: [www-validator] <none>

From: Lachlan Hunt <lachlan.hunt@lachy.id.au>
Date: Fri, 15 Apr 2005 11:56:46 +1000
Message-ID: <425F1F5E.8010507@lachy.id.au>
To: Dave Lumley <davidlumley@msn.com>
CC: www-validator@w3.org

Dave Lumley wrote:
> Line 1, column 0: character "ï" not allowed in prolog
> 
> ü/strong>?»¿<html><!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 
> Transitional//EN">

When I validate it, I don't get that error message.  However, I do get 
dozens of others.

> I look in my file and there is nothing preceding my first line and 
> <html> tag

That is the first problem.  The DOCTYPE *should* be the first thing in 
the  file, only preceded by whitespace or a byte order mark (BOM). 
(technically, other things may precede it too, but nothing you need to 
worry about, since to do so will send some browsers into quirks mode) 
The <html> start-tag must follow the DOCTYPE.

If you wish to continue to use a Transitional DOCTYPE (though I 
recommend you use Strict and stop using deprecated features), then make 
this the first part of the file:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
     "http://www.w3.org/TR/html4/loose.dtd">
<html lang="en">

Including the URI in the DOCTYPE ensures that browsers will try to 
render the document in standards mode (or almost standards mode), rather 
than the buggy, quirks mode designed to be backwards compatible with the 
broken behaviour of Netscape and IE 4.x.  Doing this may result in 
rendering changes in your page because it currently triggers quirks 
mode, but it's for the better because the browsers are more closely 
following the standards.

> the uri is www.phoenix-divers.com/index.html

It seems the very first character in the file is a UTF-8 BOM.  These 
three symbols:  (which are actually the 3-octet UTF-8 BOM) appear 
when viewing the site because the character encoding has been 
incorrectly declared as ISO-8859-1.

These problems are discussed in more detail in my 3 part guide to unicode.
http://lachy.id.au/log/2004/12/guide-to-unicode-part-1
http://lachy.id.au/log/2004/12/guide-to-unicode-part-2
http://lachy.id.au/log/2005/01/guide-to-unicode-part-3

-- 
Lachlan Hunt
http://lachy.id.au/
http://GetFirefox.com/     Rediscover the Web
http://GetThunderbird.com/ Reclaim your Inbox
Received on Friday, 15 April 2005 01:56:51 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:18 GMT