W3C home > Mailing lists > Public > public-html@w3.org > August 2007

Entities (part of detailed review)

From: Jirka Kosek <jirka@kosek.cz>
Date: Wed, 01 Aug 2007 12:13:26 +0200
Message-ID: <46B05CC6.2090506@kosek.cz>
To: "public-html@w3.org" <public-html@w3.org>
Hi,

please find below my comments to section 8.6. Entities
(http://www.w3.org/html/wg/html5/#entities).

#1
The table could be very confusing as many of entities are presented
twice with and without trailing semicolon. At the first sight I thought
this was just typo but it seems that reason for this is to mimics legacy
behavior of entity processing. But if someone will just follow the
reference to this section he or she could be very easily confused.

Because of this, I think that there should be note clarifying why some
entities are presented twice in the table and pointing to an appropriate
part of parsing algorithm, probably 8.2.3.1 Tokenising entities
(http://www.w3.org/html/wg/html5/#tokenising).

#2
For better readability table should be extended and for each entity name
of character and sample rendering should be added. Something like

|AElig;|U+00C6|Æ|LATIN CAPITAL LETTER AE|

#3
Entities are not generally supported in XHTML serialization and thus
this section should explicitly say that entities are supported only in
HTML seralization to prevent confusion.

Jirka

-- 
------------------------------------------------------------------
  Jirka Kosek      e-mail: jirka@kosek.cz      http://xmlguru.cz
------------------------------------------------------------------
       Professional XML consulting and training services
  DocBook customization, custom XSLT/XSL-FO document processing
------------------------------------------------------------------
 OASIS DocBook TC member, W3C Invited Expert, ISO JTC1/SC34 member
------------------------------------------------------------------


Received on Wednesday, 1 August 2007 10:12:54 UTC

This archive was generated by hypermail 2.3.1 : Monday, 29 September 2014 09:38:47 UTC