Update to unicode.xml

If you don't follow updates to the unicode.xml file used as the source 
for entity definitions in mathml and html
skip this message:-)

Christian's recent  questions about unicode.xml (and some recent bug 
reports about unicode-math latex package)
prompted me to look again at the tex mappings in unicode.xml.

The existing ones were mostly speculative assignments dating from the 
1990's some years before the bulk of
math characters  were added to Unicode.

I have extended the schema to allow multiple <latex> and <mathlatex> 
elements so the file can track different mappings,
and added a set attribute do distinguish these. So that now for example 
there are entries such as

          <mathlatex set="unicode-math">\rightarrow</mathlatex>

for U+"2192" .

This  mathlatex set="unicode-math" set is mechanically extracted from 
the source of the unicode-math package
(the principle method for using unicode math fonts with xelatex and 
lualatex)
https://github.com/wspr/unicode-math/blob/master/unicode-math-table.tex

So, while I'm not sure I like all the mappings here they correspond to 
running TeX code which is a definite improvement
over the previous ones.

Frédéric Wang reported some problems with the TeX mappings a while back 
I haven't fixed those yet, I may just remove them
in favour of this new set, or perhaps this set and a set derived from a 
package for classic TeX  (amssymb or stix-latex)
comments welcome on the best plan of action here....

While I was extending the schema anyway I added an attribute to the top 
level element  that records which version of
Unicode the data is supposed to match. (currently 6.3)

David

http://www.w3.org/2003/entities/2007xml/

Received on Sunday, 22 June 2014 17:31:40 UTC