arXiv updates and MathML Intent

Hi everyone,

As of this morning, arXiv's experimental HTML pages are now using an intent
attribute for the first  time. Namely, each math element is annotated with
the ":literal" property.

":literal" values are the only reliable starting point for graduate level
texts in STEM, where simple heuristics fail as often as they succeed (if
that). They fit Louis' pragmatic stance of "just show me what the author
has written", while also allowing more informed intent attributes to be
deposited on the subtrees, when known.

Additional intent upgrades are interesting future work for arXiv - and
LaTeXML. We would like to gradually enrich subtrees, whenever we have
sufficient confidence we can infer the mathematical concepts.

These new attributes for arXiv are only starting to trickle in with the
article batches announced daily. It will take a full regeneration of the
collection to have them everywhere. That could happen near the end of 2025,
maybe just in time for the MathML 4 CR phase. We'll see.

Here is one random recently announced article, to illustrate today's
changes:
https://arxiv.org/html/2509.12354v1

P.S. My regrets for tomorrow's WG meeting, I will be traveling.

Greetings,
Deyan

Received on Wednesday, 17 September 2025 21:40:27 UTC