Re: FW: spoofing and IRIs (from Michel Suignard) from Slim Amamou on 2010-03-03 (public-iri@w3.org from March 2010)

From: Slim Amamou <slim@alixsys.com>
Date: Wed, 3 Mar 2010 16:00:44 +0100
To: Larry Masinter <LMM@acm.org>
Cc: public-iri@w3.org
Message-ID: <e5c98a401003030700w5c5abfacy217660ca3bd8ce6e@mail.gmail.com>

IRI is a sequence of Unicode characters that has a special meaning.
Like 2010 or 3,1415. For instance PI, and all real numbers, have a
special separator character which is the comma. And real numbers have
a special internal structure (or syntax) that allow for interpreting
them following a logical mental path. That syntax could be summarized
in two rules :
1- real numbers must be presented visually ordered LTR beginning with
the highest end and ending with the lowest end. Regardless of the
directionality of the overall context
2- the integer part must be on the left side of the comma separator
and the fractional part must be on the right side of the comma.
Regardless of the directionality of the overall context

I think those rules are IRI dependent and are outside the scope of
Unicode. The only way for unicode to handle this kind of rules is to
have an extensibility mechanism which allows for adding new specs for
new notations. Something like what is done at IETF for URI schemes.

On Wed, Mar 3, 2010 at 3:18 PM, Larry Masinter <LMM@acm.org> wrote:
> Here's what I don't understand:
>
> An IRI is a sequence of Unicode characters. Is there not
> already a well-defined way of converting a sequence of
> Unicode characters to a visual display?
>
> So how can there be a separate rule for converting
> an IRI to a visual display which doesn't match the
> Unicode rule?
>
> Larry

-- 
Slim Amamou | سليم عمامو
http://alixsys.com

Received on Wednesday, 3 March 2010 15:01:18 UTC