Re: [Json] Encoding detection

John Cowan writes:

> There are 68 of them on the Basic Multilingual Plane.  But many
> characters in other planes involve such 16-bit code units.  For example,
> all of U+10000 to U+103FF are encoded as D800 DC00 through D800 DFFF.
> Currently there are 622 characters in this range alone, and the number
> will probably grow.
>
>> Not sure about the status of U+4E00, one variant of the ideograph for
>> the numeral 1).
>
> Google reports over 3 gigahits for this character.

Thanks for the facts, much better than my suppositions.

ht
-- 
       Henry S. Thompson, School of Informatics, University of Edinburgh
      10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440
                Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk
                       URL: http://www.ltg.ed.ac.uk/~ht/
 [mail from me _always_ has a .sig like this -- mail without it is forged spam]

Received on Friday, 15 November 2013 08:46:09 UTC