- From: Henry S. Thompson <ht@inf.ed.ac.uk>
- Date: Fri, 15 Nov 2013 08:45:00 +0000
- To: John Cowan <cowan@mercury.ccil.org>
- Cc: "Joe Hildebrand \(jhildebr\)" <jhildebr@cisco.com>, "www-tag\@w3.org" <www-tag@w3.org>, Paul Hoffman <paul.hoffman@vpnc.org>, Pete Cordell <petejson@codalogic.com>, JSON WG <json@ietf.org>
John Cowan writes: > There are 68 of them on the Basic Multilingual Plane. But many > characters in other planes involve such 16-bit code units. For example, > all of U+10000 to U+103FF are encoded as D800 DC00 through D800 DFFF. > Currently there are 622 characters in this range alone, and the number > will probably grow. > >> Not sure about the status of U+4E00, one variant of the ideograph for >> the numeral 1). > > Google reports over 3 gigahits for this character. Thanks for the facts, much better than my suppositions. ht -- Henry S. Thompson, School of Informatics, University of Edinburgh 10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440 Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk URL: http://www.ltg.ed.ac.uk/~ht/ [mail from me _always_ has a .sig like this -- mail without it is forged spam]
Received on Friday, 15 November 2013 08:46:09 UTC