Re: 求助:關於Big5和Big5-HKSCS的問題

(12/04/17 19:57), Kang-Hao (Kenny) Lu wrote:
> (12/04/17 19:40), Bob Chao wrote:
>> 或許開個 Google spreadsheet 大家上去幫忙人肉比對一下?
> 
> 歡迎抽空幫忙,目前就是[1]和[2]而已。不過我在想辦法造出新的(含有日文的)。
> 
> [1] (.tw)
> https://gitorious.org/whatwg/big5/blobs/master/big5-hkscs-vs-uao.txt
> [2] (.hk) http://www.w3.org/html/ig/zh/wiki/Big5-hkscs-vs-uao-in-hk

剛剛阿菲跟我講它這部份其實都做得差不多了,不過 .hk 裡面有七個他分不出來
的就請大家幫忙一下好了。可能需要人肉搜尋、推理:

1.
http://www.cepa.com.hk/office/officeAdminFurnitureFactory.asp

bytes: FCF8

hkscs: 超<U+6822 栢>(香港)有限公司&nbsp;

uao:   超<U+8063 聣>(香港)有限公司&nbsp;

Link to Superb (HK) Limited <http://www.superbhk.com/>, which is dead.

archive.org has copies, but none with the Chinese name.


2.
http://www.budaedu.org.hk/budaedu/qm-04.html

bytes: 9EA4

hkscs: ,不許改革。彼預與其族叔祖楚�<U+48F3 䣳>翩A議其辦法。�<U+48F3 䣳>
翩A乃明理通人,極為贊成。遂於

uao:   ,不許改革。彼預與其族叔祖楚�<U+92F6 鋶>翩A議其辦法。�<U+92F6 鋶>
翩A乃明理通人,極為贊成。遂於

Weird error handling, both Opera and Firefox actually display:

彼預與其族叔祖楚�公,議其辦法。�公,乃明理通人,極為贊成。

彼預與其族叔祖楚�公,議其辦法。�公,乃明理通人,極為贊成。

Original bytes are:

b'\xa9\xbc\xb9w\xbbP\xa8\xe4\xb1\xda\xa8\xfb\xaf\xaa\xb7\xa1\xd1\x9e\xa4\xbd\xa1A\xc4\xb3\xa8\xe4\xbf\xec\xaak\xa1C\xd1\x9e\xa4\xbd\xa1A\xa4D\xa9\xfa\xb2z\xb3q\xa4H\xa1A\xb7\xa5\xac\xb0\xc3\xd9\xa6\xa8\xa1C'


3.
http://www.ychlccsc.edu.hk/info07.html

bytes: 8E50

hkscs: ng>內設可供燒製大型壁畫的陶<U+7AB0 窰>電動拉坯機、坭板機、真空混坭機

uao:   ng>內設可供燒製大型壁畫的陶<U+8BD6 诖>電動拉坯機、坭板機、真空混坭機

Typo of 密?


4.
http://www.budaedu.org.hk/budaedu/shwd-02.html

bytes: 9FA5

hkscs: 《補史記‧三皇本紀》說:『太�<U+4C3B 䰻>]犧氏,風姓,代燧人氏繼天而王

uao:   《補史記‧三皇本紀》說:『太�<U+7F37 缷>]犧氏,風姓,代燧人氏繼天而王

Opera and Firefox say:

『太�包犧氏,風姓,代燧人氏繼天而王。母曰華胥,履大人跡於雷澤,而生庖犧
於成紀。蛇身人首,有聖德。』

Original bytes:

b'\xa1y\xa4\xd3\xc6\x9f\xa5]\xc4\xeb\xa4\xf3\xa1A\xad\xb7\xa9m\xa1A\xa5N\xc0\xe6\xa4H\xa4\xf3\xc4~\xa4\xd1\xa6\xd3\xa4\xfd\xa1C\xa5\xc0\xa4\xea\xb5\xd8\xadE\xa1A\xbci\xa4j\xa4H\xb8\xf1\xa9\xf3\xb9p\xbfA\xa1A\xa6\xd3\xa5\xcd\xa9\xb4\xc4\xeb\xa9\xf3\xa6\xa8\xac\xf6\xa1C\xb3D\xa8\xad\xa4H\xad\xba\xa1A\xa6\xb3\xb8t\xbcw\xa1C\xa1z'


5.
http://www.htsps.edu.hk/Web/03_TeachLearn/chi.htm

bytes: FCDF

hkscs: 楊佩汶</td><td>4B趙<U+6667 晧>峻</td><td>3B梁文威

hkscs: 李承恩</td><td>3B趙<U+6667 晧>崚</td></tr>

hkscs: 李承恩</td><td>3B趙<U+6667 晧>崚</td></tr>

uao:   楊佩汶</td><td>4B趙<U+7BCF 篏>峻</td><td>3B梁文威

uao:   李承恩</td><td>3B趙<U+7BCF 篏>崚</td></tr>

uao:   李承恩</td><td>3B趙<U+7BCF 篏>崚</td></tr>

See http://www.htsps.edu.hk/Web/03_TeachLearn/eng.htm


6.
http://www.mihk.hk/forum/thread-962765-1-1.html

bytes: FC70

hkscs: t> 下1湯匙油燒熱鍋,爆香乾<U+27A53 𧩓>片,下芋頭略煮片刻,灒酒略拌勻

uao:   t> 下1湯匙油燒熱鍋,爆香乾<U+E16A >片,下芋頭略煮片刻,灒酒略拌勻

Same character in some fonts, but UAO uses PUA mapping. Possible typo.


7.
http://www.htsps.edu.hk/Web/03_TeachLearn/eng.htm

bytes: FCDF

hkscs: t-family:標楷體'>趙<U+6667 晧>峻</span></p>

uao:   t-family:標楷體'>趙<U+7BCF 篏>峻</span></p>

Given name Chiu Ho Tsun: 趙晧峻 or 趙篏峻?



此致

Kenny

Received on Tuesday, 17 April 2012 12:13:36 UTC