W3C home > Mailing lists > Public > www-style@w3.org > May 2012

[css3-writing-modes] Character code histogram from aozora.gr.jp

From: Koji Ishii <kojiishi@gluesoft.co.jp>
Date: Sun, 20 May 2012 12:46:25 -0400
To: "www-style@w3.org" <www-style@w3.org>
Message-ID: <A592E245B36A8949BDB0A302B375FB4E0D5E3BEDF1@MAILR001.mail.lan>
For anyone who may be interested in, I created character code histogram from 10,873 Japanese literature from aozora.gr.jp[1].

The data is here in CSV[2].

Tables joined with other Unicode properties are also available in CSV[3] and XLSX[4]. In the joined tables, Han and PUA are omitted.

[1] http://en.wikipedia.org/wiki/Aozora_Bunko
[2] https://bitbucket.org/kojiishi/ucd/raw/tip/aozora.gr.jp.csv
[3] https://bitbucket.org/kojiishi/ucd/raw/tip/unicode.csv
[4] https://bitbucket.org/kojiishi/ucd/raw/tip/unicode.xlsx

Regards,
Koji
Received on Sunday, 20 May 2012 16:46:56 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 17:20:54 GMT