W3C home > Mailing lists > Public > whatwg@whatwg.org > March 2006

[whatwg] Internal character encoding declaration

From: Henri Sivonen <hsivonen@iki.fi>
Date: Sun, 12 Mar 2006 16:46:13 +0200
Message-ID: <3B462A81-B86C-4F9C-B087-CA2FC41877C8@iki.fi>
On Mar 12, 2006, at 00:49, Henri Sivonen wrote:

>> Encoding errors are easy parse errors. (Emit U+FFFD on bogus data.)
>
> Except for the ISO-8859-* family the easy error recovery should be  
> emitting the characters according to the corresponding Windows-*  
> family superset.

But those aren't strictly encoding errors. One more try:

For ISO-8859-* family encodings that have a corresponding Windows-*  
family superset (e.g. Windows-1252 for ISO-8859-1) the UA must use  
the Windows-* family superset decoder instead of the ISO-8859-*  
family decoder. However, any bytes in the 0x80?0x9F (inclusive) are  
easy parse errors.

-- 
Henri Sivonen
hsivonen at iki.fi
http://hsivonen.iki.fi/
Received on Sunday, 12 March 2006 06:46:13 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:58:45 UTC