W3C home > Mailing lists > Public > whatwg@whatwg.org > December 2011

[whatwg] HTML5 named entity ≫ and ≪

From: Mike Samuel <mikesamuel@gmail.com>
Date: Wed, 14 Dec 2011 12:18:30 -0500
Message-ID: <CACod6Gv=8tU4pSOdw7o_C4uXNJgcXjEik3qeRMDgBZSs+YcmMQ@mail.gmail.com>
The table in section 12.5 (
http://www.whatwg.org/specs/web-apps/current-work/multipage/named-character-references.html
) says
> GT;    U+0003E        >
> Gt;    U+0226B        ?
> gt;    U+0003E        >
> GT     U+0003E        >
> gt     U+0003E        >

which I believe means that "&GT;", "&gt;","&GT", and "&gt" all encode
">" but "&Gt;" encodes U+226B MUCH GREATER-THAN.

http://svn.whatwg.org/webapps/entities-unicode.inc includes these but
the entities-legacy.inc does not.

Similarly

> Lt;    U+0226A        ?

This is a potential source of confusion for naive HTML entity decoders
fall-back to case-insensitive matching when there is no mapping for a
given entity name.

MathML already has other succinct mappings for U+226A (&ll;) and
U+226B (&gg;).  Could HTML5 avoid confusion by deprecating &Lt; and
&Gt; in favor of &ll; and &gg; or remove them entirely?

http://www.google.com/codesearch#search/&q=&amp;Gt;%20file:.html$%20case:yes&type=cs
shows four files using "&Gt;", 2 of which treat it as synonymous with "&gt;".
Received on Wednesday, 14 December 2011 09:18:30 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:59:38 UTC