- From: Etienne Kroger <etienne.kroger@welocalize.com>
- Date: Wed, 15 Nov 2000 10:33:44 -0500
- To: <www-international@w3.org>
Rajat, As illustrated by our colleagues, there is not a one-to-one relationship between encodings and languages. Encoding schemes based on the Unicode character set or ISO-8859 span many languages. Likewise, languages can be encoded in many ways depending on the underlying technologies (e.g. Japanese or Korean listed below). This is what makes this so interesting. :) Etienne -----Original Message----- From: www-international-request@w3.org [mailto:www-international-request@w3.org]On Behalf Of Patrick Jenny Sent: Wednesday, November 15, 2000 9:36 AM To: www-international@w3.org Subject: RE: Language questions... Of course, they are encoded in Unicode as well. -----Original Message----- From: Shigemichi Yazawa [mailto:yazawa@globalsight.com] Sent: Tuesday, November 14, 2000 6:23 PM To: www-international@w3.org Subject: Re: Language questions... At Wed, 15 Nov 2000 09:02:57 +0900, Bhatnagar, Rajat <Rajat.Bhatnagar@BroadVision.com> wrote: > 1. The character set for the above languages ? I guess what you want to know is the character encodings. Is this right? Although I'm sure there are many encoding schemes for each language, I'll list what I know. > English > French > Danish > Finish > German > Italian > Dutch > Norwegian > Portuguese > Swedish > Spanish ISO-8859-1 > Polish ISO-8859-2 > Greek ISO-8859-7 > Hebrew ISO-8859-8 > Turkish ISO-8859-9 > Urdu (Indian) don't know... ISO-8859-6 (Arabic) lacks some characters needed for Urdu. > Chinese (Simplified) GBK EUC_CN ISO-2022-CN > Chinese (Traditional) Big5 EUC_TW ISO-2022-CN > Japanese Shift_JIS EUC_JP ISO-2022-JP > Korean Johab EUC_KR ISO-2022-KR > 2. Of the above languages which one writes from right to left ? Hebrew and Urdu. ------------------- Shigemichi Yazawa yazawa@globalsight.com
Received on Wednesday, 15 November 2000 10:28:29 UTC