[css3-writing-modes] Character code histogram from aozora.gr.jp from Koji Ishii on 2012-05-20 (www-style@w3.org from May 2012)

From: Koji Ishii <kojiishi@gluesoft.co.jp>
Date: Sun, 20 May 2012 12:46:25 -0400
To: "www-style@w3.org" <www-style@w3.org>
Message-ID: <A592E245B36A8949BDB0A302B375FB4E0D5E3BEDF1@MAILR001.mail.lan>

For anyone who may be interested in, I created character code histogram from 10,873 Japanese literature from aozora.gr.jp[1].

The data is here in CSV[2].

Tables joined with other Unicode properties are also available in CSV[3] and XLSX[4]. In the joined tables, Han and PUA are omitted.

[1] http://en.wikipedia.org/wiki/Aozora_Bunko
[2] https://bitbucket.org/kojiishi/ucd/raw/tip/aozora.gr.jp.csv
[3] https://bitbucket.org/kojiishi/ucd/raw/tip/unicode.csv
[4] https://bitbucket.org/kojiishi/ucd/raw/tip/unicode.xlsx

Regards,
Koji

Received on Sunday, 20 May 2012 16:46:56 UTC