comment: upgrading to Unicode

FAQ: Upgrading from language-specific legacy encoding to Unicode encoding

http://www.w3.org/International/questions/qa-utf8-upgrade.en.php

"In addition, many legacy encodings for complex scripts are already 
double-byte, eg, Chinese."

"Characters that do not fall into the ASCII range, such as Chinese, 
Arabic, Russian, may use 2 or even 3 bytes. Chinese encodings already 
use more than 1 byte per character with legacy encodings, where they use 
double bytes."

These would be more accurate if revised to:

"In addition, many legacy encodings for complex scripts are already 
multibyte, eg, Chinese."

"Characters that do not fall into the ASCII range, such as Chinese, 
Arabic, Russian, may use 2, 3 or even 4 bytes. Chinese encodings already 
use more than 1 byte per character with legacy encodings, where they 
normally use 2 or 3 bytes."

Regards,
-Dan

Received on Tuesday, 28 October 2008 18:07:12 UTC