- From: Henry S. Thompson <ht@inf.ed.ac.uk>
- Date: Fri, 15 Nov 2013 08:45:00 +0000
- To: John Cowan <cowan@mercury.ccil.org>
- Cc: "Joe Hildebrand \(jhildebr\)" <jhildebr@cisco.com>, "www-tag\@w3.org" <www-tag@w3.org>, Paul Hoffman <paul.hoffman@vpnc.org>, Pete Cordell <petejson@codalogic.com>, JSON WG <json@ietf.org>
John Cowan writes:
> There are 68 of them on the Basic Multilingual Plane. But many
> characters in other planes involve such 16-bit code units. For example,
> all of U+10000 to U+103FF are encoded as D800 DC00 through D800 DFFF.
> Currently there are 622 characters in this range alone, and the number
> will probably grow.
>
>> Not sure about the status of U+4E00, one variant of the ideograph for
>> the numeral 1).
>
> Google reports over 3 gigahits for this character.
Thanks for the facts, much better than my suppositions.
ht
--
Henry S. Thompson, School of Informatics, University of Edinburgh
10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440
Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk
URL: http://www.ltg.ed.ac.uk/~ht/
[mail from me _always_ has a .sig like this -- mail without it is forged spam]
Received on Friday, 15 November 2013 08:46:09 UTC