W3C home > Mailing lists > Public > www-international@w3.org > October to December 2012

Re: For review: The byte-order mark (BOM) in HTML

From: Martin J. Dürst <duerst@it.aoyama.ac.jp>
Date: Thu, 20 Dec 2012 13:46:05 +0900
Message-ID: <50D2980D.5060601@it.aoyama.ac.jp>
To: John Cowan <cowan@mercury.ccil.org>
CC: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>, Albert Lunde <atlunde@panix.com>, www International <www-international@w3.org>
On 2012/12/20 13:13, John Cowan wrote:
> Leif Halvard Silli scripsit:
>
>> It seems impossible to improve the text unless Richard clarifies what
>> use the text has in mind.
>
> I agree that it's wrongly worded, but I believe the intent is clear.
> Here's my revision:
>
> "The UTF-8 encoding without a BOM has the property that a document
> which contains only characters from the US-ASCII range is encoded
> byte-for-byte the same way as the same document encoded using the
> US-ASCII encoding.  Such a document can be processed either as UTF-8 or
> as US-ASCII.  Adding a BOM inserts additional non-ASCII bytes, so this
> is no longer true."
>
> I believe that statement is correct, complete, and useful.

Very much so indeed.

Regards,   Martin.
Received on Thursday, 20 December 2012 04:46:37 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 20 December 2012 04:46:37 GMT