W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > November 2011

[Bug 14993] The list of named character references at http://www.w3.org/TR/html5/named-character-references.html (8.5 Named character references) should also be available in an easy-to-parse format (e.g. plain text or json). This will allow developers to use it with

From: <bugzilla@jessica.w3.org>
Date: Tue, 29 Nov 2011 09:44:59 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1RVKFH-0001sR-Nd@jessica.w3.org>

David Carlisle <davidc@nag.co.uk> changed:

           What    |Removed                     |Added
                 CC|                            |davidc@nag.co.uk

--- Comment #1 from David Carlisle <davidc@nag.co.uk> 2011-11-29 09:44:59 UTC ---
Note the list is derived from the same source as the xml entities spec from the
sources at


in particular unicode.xml in that directory has all the information (but it has
a lot of other information too, so might not be quite what you are looking for)

the entities are available in dtd declaration format (so more or less plain
text) as


although that only has the XML-compatible ones (not the special case html rules
that allow some common entity names to be used without a trailing ";")

It is essentially trivial to generate the same list in other formats by
modifying the xslt that extracts htmlmathml-f.ent, which is available in the
xml source directory, so I had assumed people would rather do that and generate
exactly the format they want (text, json, python, whatever) but if there is a
generally useful format that I should generate I have no objection to adding
that to the build and putting the generated files up at 


Also I'd need to know whether it was desired to list the ones without a
trailing ;

(Or maybe Ian will pick this up and I can do nothing, I don't mind:-)


Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Tuesday, 29 November 2011 09:45:22 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 16:31:22 UTC