W3C home > Mailing lists > Public > www-international@w3.org > July to September 2005

Re: bidi discussion list was: Bidi Markup vs Unicode control characters

From: Mark Davis <mark.davis@icu-project.org>
Date: Tue, 16 Aug 2005 09:26:57 -0700
Message-ID: <029401c5a27f$52cc6380$0e783009@sanjose.ibm.com>
To: "Stephen Deach" <sdeach@adobe.com>, "Jony Rosenne" <rosennej@qsm.co.il>
Cc: <www-international@w3.org>

> Martin indicated he had a list of language-to-script correlations. I'ld
> like to see it.

We have been gathering data on language->script, territory information in
CLDR. See
http://unicode.org/cldr/data/common/supplemental/supplementalData.xml for
what we currently have.

There is an HTML version at the following location, which is easier to read.
http://www.unicode.org/cldr/data/diff/supplemental/supplemental.html


‚ÄéMark

----- Original Message ----- 
From: "Stephen Deach" <sdeach@adobe.com>
To: "Jony Rosenne" <rosennej@qsm.co.il>; "'Stephen Deach'"
<sdeach@adobe.com>
Cc: <www-international@w3.org>
Sent: Monday, August 15, 2005 09:05
Subject: RE: bidi discussion list was: Bidi Markup vs Unicode control
characters


>
> Yes, they are scripts (I said so).
>
> Martin indicated he had a list of language-to-script correlations. I'ld
> like to see it.
>
> At 2005.08.15-18:45(+0200), Jony Rosenne wrote:
> >These are scripts, not languages.
> >
> >The Hebrew script, for instance, is routinely used for at least three
> >languages and rarely used for many more.
> >
> >The Arabic script is used for a number of languages today, and
historically
> >for many more.
> >
> >Jony
> >
> > > -----Original Message-----
> > > From: www-international-request@w3.org
> > > [mailto:www-international-request@w3.org] On Behalf Of John Cowan
> > > Sent: Monday, August 15, 2005 5:44 PM
> > > To: Stephen Deach
> > > Cc: Martin Duerst; Ognyan Kulev; Tex Texin; Addison Phillips;
> > > www-international@w3.org; Richard Ishida; Bert Bos
> > > Subject: Re: bidi discussion list was: Bidi Markup vs Unicode
> > > control characters
> > >
> > >
> > >
> > > Stephen Deach scripsit:
> > >
> > > > The only scripts identified as RTL in Unicode are Arabic
> > > and Hebrew.
> > >
> > > In fact, Syriac and Tifinagh are already encoded in Unicode,
> > > as well as the
> > > archaic scripts Cypriot and Kharoshthi.  Phoenician has been
> > > fully blessed
> > > and will be in the next version.
> > >
> > > Still in the long tail are Old Hungarian (aka rovasiras),
> > > Avestan, Mandaic,
> > > Samaritan, Manichaean, and perhaps others.
> > >
> > > --
> > > Newbies always ask:                             John Cowan
> > >   "Elements or attributes?
> > > http://www.ccil.org/~cowan
> > > Which will serve me best?"
> > > http://www.reutershealth.com
> > >   Those who know roar like lions;               cowan@ccil.org
> > >   Wise hackers smile like tigers.                   --a
> > > tanka, or extended haiku
> > >
> > >
> > >
> > >
>
>
> ---Steve Deach
>     sdeach@adobe.com
>
>
>
>
Received on Tuesday, 16 August 2005 16:27:10 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:05 GMT