[Bug 14993] The list of named character references at http://www.w3.org/TR/html5/named-character-references.html (8.5 Named character references) should also be available in an easy-to-parse format (e.g. plain text or json). This will allow developers to use it with

http://www.w3.org/Bugs/Public/show_bug.cgi?id=14993

David Carlisle <davidc@nag.co.uk> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |davidc@nag.co.uk

--- Comment #1 from David Carlisle <davidc@nag.co.uk> 2011-11-29 09:44:59 UTC ---
Note the list is derived from the same source as the xml entities spec from the
sources at

http://www.w3.org/2003/entities/2007xml/

in particular unicode.xml in that directory has all the information (but it has
a lot of other information too, so might not be quite what you are looking for)

the entities are available in dtd declaration format (so more or less plain
text) as

http://www.w3.org/2003/entities/2007/htmlmathml-f.ent

although that only has the XML-compatible ones (not the special case html rules
that allow some common entity names to be used without a trailing ";")

It is essentially trivial to generate the same list in other formats by
modifying the xslt that extracts htmlmathml-f.ent, which is available in the
xml source directory, so I had assumed people would rather do that and generate
exactly the format they want (text, json, python, whatever) but if there is a
generally useful format that I should generate I have no objection to adding
that to the build and putting the generated files up at 

http://www.w3.org/2003/entities/2007

Also I'd need to know whether it was desired to list the ones without a
trailing ;


(Or maybe Ian will pick this up and I can do nothing, I don't mind:-)

David

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.

Received on Tuesday, 29 November 2011 09:45:22 UTC