Re: "Difficult Characters" draft

Martin J. Duerst (mduerst@ifi.unizh.ch)
Sun, 4 May 1997 17:36:18 +0200 (MET DST)


Date: Sun, 4 May 1997 17:36:18 +0200 (MET DST)
From: "Martin J. Duerst" <mduerst@ifi.unizh.ch>
To: Larry Masinter <masinter@parc.xerox.com>
cc: URI mailing list <uri@bunyip.com>
Subject: Re: "Difficult Characters" draft
In-Reply-To: <336A3609.668A@parc.xerox.com>
Message-ID: <Pine.SUN.3.96.970504172656.245p-100000@enoshima>

On Fri, 2 May 1997, Larry Masinter wrote:

> > Other issues:
> > The bidi issues for RLT languages in conjunction with
> > normal punctuation used in and around identifiers. (Will
> > the identifiers present themselves 'correctly' without
> > these characters in all cases?)
> 
> When you type Hebrew and enter:
> 
>    "http://host.il/WERBEH/DROW"
> 
> (HEBREW WORD), might some typing software add direction markers
> and some other typing software leave it out?

When the above sequence is typed (in logical/phonetical order
as HEBREW/WORD) in a *plain text context* and with a display
engine using the Unicode BIDI algorithm, it will not appear
as desired, i.e. as WERBEH/DROW. The same applies to a lot of
other kinds of formal syntax, in particular also to HTML/SGML
syntax.

Sadly enough, there doesn't seem anything much that can be done
to change this.

The aim of the upcomming draft for BIDI for URLs (or identifiers)
is to define their display (and input) behaviour in places where
these identifiers are handled as such, i.e. the input field at
the top of a browser page, the file list display in finder-like
places, and URL input/display in structured HTML editors,...

Regards,	Martin.