For anyone who may be interested in, I created character code histogram from 10,873 Japanese literature from aozora.gr.jp[1]. The data is here in CSV[2]. Tables joined with other Unicode properties are also available in CSV[3] and XLSX[4]. In the joined tables, Han and PUA are omitted. [1] http://en.wikipedia.org/wiki/Aozora_Bunko [2] https://bitbucket.org/kojiishi/ucd/raw/tip/aozora.gr.jp.csv [3] https://bitbucket.org/kojiishi/ucd/raw/tip/unicode.csv [4] https://bitbucket.org/kojiishi/ucd/raw/tip/unicode.xlsx Regards, KojiReceived on Sunday, 20 May 2012 16:46:56 UTC
This archive was generated by hypermail 2.4.0 : Friday, 25 March 2022 10:08:16 UTC