W3C home > Mailing lists > Public > public-html@w3.org > September 2007

Re: the document character set for text/thml serialization

From: Julian Reschke <julian.reschke@gmx.de>
Date: Sun, 09 Sep 2007 16:11:34 +0200
Message-ID: <46E3FF16.20402@gmx.de>
To: Robert Burns <rob@robburns.com>
CC: HTML Working Group <public-html@w3.org>

Robert Burns wrote:
> ...
> I see now that XML 1.1 permits all of these control characters as part 
> of the document character set, however all of these ASCII control 
> characters must be included only as character references in XML 1.1. 
> That leaves only the issues of surrogates; whitespace handling for these 
> characters (if any: e.g., U+000B, U+000C, and U+0085). Though I think 
> our WGs practice of finding use cases for a feature before including it 
> is apt here too. Is being compatible with XML 1.1 enough of a use case? 
> How would authors use these characters?
> ...

My personal impression was that XML 1.1 is a failure; thus I wouldn't 
recommend HTML5 to rely on XML 1.1 features for the XML serialization.

We really should answer the question we asked before: why would it be 
conforming to include those characters in the first place?

Best regards, Julian
Received on Sunday, 9 September 2007 14:11:51 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 9 May 2012 00:16:07 GMT