Extension Specification for XHTML5 entity definitions.

On 09/01/2013 16:00, Sam Ruby wrote:

> 3) Bugs resolved by Ian but should be reopened (for 5.1) in order to
>  get our new editor team to take a position on the matter:
> https://www.w3.org/Bugs/Public/show_bug.cgi?id=13409
> https://www.w3.org/Bugs/Public/show_bug.cgi?id=15213
> https://www.w3.org/Bugs/Public/show_bug.cgi?id=18384
> Note: for the bugs in the third bucket, it should still be possible
> for the function to be included in HTML 5.0, should an extension
> spec be written and progress by made towards satisfying the exit
> criteria:

In conjunction with the Math WG I have prepared an extension document
proposing to resolve bug 13409 by adding an additional public ID to the
list of recognised XHTML PUBLIC identifiers listed in the HTML


In short the situation in XHTML5 as currently specified is that:

if you use an html5 entity such as &

then you get a fatal document parse error if you specify a DTD that
defines AMP, but the document parses and AMP is defined if you specify
the XHTML 1 or MathML2 DTD (neither of which define AMP).

Conversely if you specify a MathML entity such as
⟹ It will work if you specify the XHTML 1 DTD
(which does not define this) but not work if you specify the currently
supported W3C entity set that does.

This is clearly a confusing and sub-optimal situation that leads to data
corruption if documents are moved between XHTML user agents and standard
XML tool-chains.

The Entities in MathML3 and HTML5 are derived from the definitions in
the W3C XML Entity Names Recommendation


and this extension specification proposes adding the PUBLIC identifier
of the HTML5/MathML3 entity set to the list of identifiers recognised by
XHTML user agents.


Note this is an "un-versioned" identifier so is a one-time bug fix
addition, the list does not need to be extended for each new version of
HTML. Adding this one line is the _sole_ change proposed by this
extension specification.

Before progressing this bug it would be good to get some feedback from 
implementers: would adding


to the list of recognised doctype PUBLIC identifiers be
impossibly hard, trivially easy, or inconvenient but you'd do it if
the spec changed?


The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs. 

Received on Thursday, 17 January 2013 17:20:40 UTC