W3C home > Mailing lists > Public > public-html-comments@w3.org > September 2009

Re: What makes illegal characters non-conformant

From: Henry S. Thompson <ht@inf.ed.ac.uk>
Date: Wed, 23 Sep 2009 19:54:08 +0100
To: Geoffrey Sneddon <foolistbar@googlemail.com>
Cc: public-html-comments@w3.org
Message-ID: <f5by6o5h4q7.fsf@hildegard.inf.ed.ac.uk>
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Geoffrey Sneddon writes:

> On 23 Sep 2009, at 15:12, Henry S. Thompson wrote:
>
>> although http://www.ltg.ed.ac.uk/~ht/char_alias.xml
>> is _not_ broken per the XML specification. . .
>
> It should be, per:
>
>> It is a fatal error if an XML entity is determined (via default,
>> encoding declaration, or higher-level protocol) to be in a certain
>> encoding but contains byte sequences that are not legal in that
>> encoding.

You're right, I was mistaken.

> That said, though processors must throw a fatal error, I can't see
> anything saying the document isn't well-formed (bug?).

Hmm.

ht
- -- 
       Henry S. Thompson, School of Informatics, University of Edinburgh
                         Half-time member of W3C Team
      10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440
                Fax: (44) 131 651-1426, e-mail: ht@inf.ed.ac.uk
                       URL: http://www.ltg.ed.ac.uk/~ht/
[mail really from me _always_ has this .sig -- mail without it is forged spam]
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)

iD8DBQFKum7QkjnJixAXWBoRAl7dAJ9YERQmccq5h1FQC+/y+8ya5DRfcwCghAT2
rfoIGs4VEOSoEQ8HKz23Yc8=
=MnAk
-----END PGP SIGNATURE-----
Received on Wednesday, 23 September 2009 18:54:45 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 1 June 2011 00:14:00 GMT