W3C home > Mailing lists > Public > www-international@w3.org > July to September 2001

RE: Unicode <-> CJKV national encoding; supporting multi-lingualwebcontent

From: Carl W. Brown <cbrown@xnetinc.com>
Date: Mon, 20 Aug 2001 13:21:55 -0700
To: <www-international@w3.org>
Message-ID: <FNEHIHOMIIDPDCIFEJEGKEHDCIAA.cbrown@xnetinc.com>
Yung,

I have added GB18030 support to xIUA.  It looks like it GB18030 is fairly
complete except I could not find the mappings for Yi or Hangul symbols
U+A000 to U+D7A3.

While Yi is a minor language, Korean is not.  I think that a multilingual
solution should include full Korean support.

Carl


>
> > > > > The only way to sanely implement a multi-lingual site is using
> > > > Unicode.
> > I think you can also use GB18030 these days, if you care about
> > GB2312 back ward
> > compatability.
>
> I don't expect to see GB18030 to pick up the range of non-Chinese scripts
> that Unicode has.  Yes the assignments are automatic but it practice will
> they be used?  GB18030 is more awkward to process that most MBCS scripts.
> You can not always test the character length from the first character.  It
> is also difficult to determine the start of a character when you access a
> string randomly.
>
> ICU http://oss.software.ibm.com/icu/ supports GB18030
> http://www-106.ibm.com/developerworks/library/u-china.html?dwzone=unicode
> but I have not added yet added support in xIUA.
> http://www.xnetinc.com/xiua/ Maybe you can help me.  It there a good
> algorithm for finding the start of a character without going to the
> beginning of the string?
>
>
Received on Monday, 20 August 2001 16:22:05 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:57 GMT