- From: Adil Allawi <adil@diwan.com>
- Date: Mon, 06 Jun 2011 22:54:06 +0100
- To: public-iri@w3.org
- Message-ID: <4DED4C7E.1090706@diwan.com>
I have been working on summarizing this - I will try to post something tomorrow. Adil On 06/06/2011 13:53, Larry Masinter wrote: > > Could someone summarize the requirements for BIDI representation and > display, and the design choices we’re facing and how they match up > against the requirements? > > It seems to me that we’re in the unfortunate situation that the > “desirable” handling of IRIs for BIDI identifiers cannot actually be > accomplished with the technology at hand, and that we’re going to have > to wind up with the unfortunate but unavoidable situation where we > have to make some compromises to get something that will work at all. > > These lengthy discussions about “desirable” handling of BIDI URIs > don’t help much if we’re not actually evaluating technological solutions. > > It may be that “side of bus” printing for BIDI IRIs are limited, for > example, or that we might need to establish some other additional > typographical conventions for side-of-bus display of BIDI IRIs. > > The technology we have at hand is pretty weak > > – can non-visible direction characters be part of the IRI? > > -Can we, should we, advise those who are implementing novel IRI > display mechanisms (“show IRI in address bar”) and IRI entry > mechanisms (“type IRI in the address bar”) to do something different > from the ordinary “give IRI string to ordinary unicode string display > mechanism”. > > Larry > > -- > > http://larry.masinter.net > > *From:*public-iri-request@w3.org [mailto:public-iri-request@w3.org] > *On Behalf Of *Shawn Steele > *Sent:* Sunday, June 05, 2011 12:20 PM > *To:* Aharon (Vladimir) Lanin; Matitiahu Allouche > *Cc:* bidi@unicode.org; bidi-bounce@unicode.org; Mark Davis ☕; Mohamed > Mohie; public-iri@w3.org; public-iri-request@w3.org > *Subject:* RE: [bidi] BIDI? > > I think that the "side of a bus" case often skips the http:// part, so > it does matter a bit. > > FWIW: I don't think we have to worry about this case so much: > > http://worldblog.msnbc.msn.com/_news/2011/06/05/6789539-amid-the-ruins-a-fisherman-contemplates-a-daunting-future > > but rather simpler cases like "msnbc.com", "biz.host.com", or > "host.com/biz", since that's what's on the side of a bus. And, of > course, the email variations. > > I don't know if that "simplifies" the problem any, but a few RTL > characters deep in an obscure file path in an otherwise LTR string > probably aren't very interesting. > > Also, ancedotal evidence suggests that the "average" user may not be > aware that www.msnbc.com <http://www.msnbc.com> means "the www server > at msnbc, which registered with .com". It can be misinterpreted as > "msnbc's part of the web (www)", eg, msnbc somehow registered with > www. So I don't think we can ensure that LTR or RTL ordering > preserves some sort of security heirarchy, at least for the average user. > > I think the key point is "how do we get someone to write it down and > key it in later without any mistakes"? > > -Shawn > > > > http://blogs.msdn.com/shawnste > > ------------------------------------------------------------------------ > > *From:*Aharon (Vladimir) Lanin [aharon@google.com] > *Sent:* Sunday, June 05, 2011 11:07 AM > *To:* Matitiahu Allouche > *Cc:* bidi@unicode.org; bidi-bounce@unicode.org; Mark Davis ☕; Mohamed > Mohie; public-iri@w3.org; public-iri-request@w3.org; Shawn Steele > *Subject:* Re: [bidi] BIDI? > > You have a point, although for http://MY.DOMAIN.org and > http://org.DOMAIN.MY, the results would be different: > > org.NIAMOD.YM//:http and http://org.NIAMOD.YM, respectively. > > Aharon > > On Sun, Jun 5, 2011 at 7:17 PM, Matitiahu Allouche <matial@il.ibm.com > <mailto:matial@il.ibm.com>> wrote: > > Aharon (Vladimir) Lanin wrote: "To my taste, first strong in the > domain name is best". > First strong in the domain name fails the napkin test. If the logical > name is (upper case = RTL): > MY.DOMAIN.org <http://MY.DOMAIN.org> > it would be displayed > org.NIAMOD.YM > > Such a display could come from the logical name "MY.DOMAIN.org > <http://MY.DOMAIN.org>", but also from "org.MY.DOMAIN", thus it is not > unambiguous. > > > Shalom (Regards), Mati > > > > From: "Aharon (Vladimir) Lanin" <aharon@google.com > <mailto:aharon@google.com>> > > To: Matitiahu Allouche/Israel/IBM@IBMIL > > Cc: Shawn Steele <Shawn.Steele@microsoft.com > <mailto:Shawn.Steele@microsoft.com>>, bidi@unicode.org > <mailto:bidi@unicode.org>, bidi-bounce@unicode.org > <mailto:bidi-bounce@unicode.org>, "public-iri@w3.org > <mailto:public-iri@w3.org>" <public-iri@w3.org > <mailto:public-iri@w3.org>>, Mohamed Mohie <MOHIEM@eg.ibm.com > <mailto:MOHIEM@eg.ibm.com>>, public-iri-request@w3.org > <mailto:public-iri-request@w3.org>, Mark Davis ☕ <mark@macchiato.com > <mailto:mark@macchiato.com>> > Date: 05/06/2011 18:43 > > Subject: Re: [bidi] Re: BIDI? > > ------------------------------------------------------------------------ > > > > > I think that there needs to be a secondary objective: to get all-rtl > iris displayed rtl overall, not in a constant back-and-forth at every > separator. Like Mohammed, I think that this should be based on the > presence of rtl in the domain name. To my taste, first strong in the > domain name is best, but I think that the exact algorithm to use (on > the domain name) is less important. > > Aharon > > On Jun 5, 2011 10:27 AM, "Matitiahu Allouche" <matial@il.ibm.com > <mailto:matial@il.ibm.com>> wrote: > > Please define "mostly Latin" and "mostly Arabic or Hebrew". > > > > Are you suggesting to count LTR and RTL characters? Are they all equally > > weighted? > > Does the counting include the scheme (e.g. "http")? the TLD? > > > > Please consider that the prime objective, IMHO, is to enable easy and > > unambiguous human translation from a displayed IRI (napkin, bus side) to > > the corresponding logical string. > > > > Shalom (Regards), Mati > > Bidi Architect > > Globalization Center Of Competency - Bidirectional Scripts > > IBM Israel > > Fax: +972 2 5870333 <tel:%2B972%202%205870333> Mobile: +972 52 > 2554160 <tel:%2B972%2052%202554160> > > > > > > > > > > From: Mohamed Mohie <MOHIEM@eg.ibm.com <mailto:MOHIEM@eg.ibm.com>> > > To: Matitiahu Allouche/Israel/IBM@IBMIL > > Cc: bidi@unicode.org <mailto:bidi@unicode.org>, > bidi-bounce@unicode.org <mailto:bidi-bounce@unicode.org>, Mark Davis ☕ > > <mark@macchiato.com <mailto:mark@macchiato.com>>, "public-iri@w3.org > <mailto:public-iri@w3.org>" <public-iri@w3.org > <mailto:public-iri@w3.org>>, Shawn > > Steele <Shawn.Steele@microsoft.com <mailto:Shawn.Steele@microsoft.com>> > > Date: 03/06/2011 22:06 > > Subject: Re: [bidi] Re: BIDI? > > Sent by: public-iri-request@w3.org <mailto:public-iri-request@w3.org> > > > > > > > > Hello Mati, > > To overcome the problem you highlighted below I have a suggestion to be > > added for the URL design which is to set the embedding level according to > > the directionality of the domain name. > > 1- If the domain name "MY.OWN.DOMAIN" is mostly Latin set the embedding > > level to even. > > 2- If the domain name "MY.OWN.DOMAIN" is mostly Arabic or Hebrew set the > > embedding level to odd. > > > > Thanks And Best regards, > > Mohamed Mohie , PMP® > > ________________________________________________ > > GCoC BIDI , > > Advisory Software Engineer, Project Manager, M.Sc. > > Cairo Technology Development Center (CTDC) > > IBM Egypt > > email : mohiem@eg.ibm.com <mailto:mohiem@eg.ibm.com> > > > > > > > > > > > > From: Matitiahu Allouche <matial@il.ibm.com <mailto:matial@il.ibm.com>> > > To: Mark Davis ☕ <mark@macchiato.com <mailto:mark@macchiato.com>> > > Cc: bidi@unicode.org <mailto:bidi@unicode.org>, > bidi-bounce@unicode.org <mailto:bidi-bounce@unicode.org>, > "public-iri@w3.org <mailto:public-iri@w3.org>" > > <public-iri@w3.org <mailto:public-iri@w3.org>>, Shawn Steele > <Shawn.Steele@microsoft.com <mailto:Shawn.Steele@microsoft.com>> > > Date: 27/04/2011 10:38 ص > > Subject: [bidi] Re: BIDI? > > Sent by: bidi-bounce@unicode.org <mailto:bidi-bounce@unicode.org> > > > > > > > > Hello, Mark! > > > > I am glad to see somebody daring to tackle this issue. > > > > You wrote: <quote> > > If a bidiIri is recognized, then it is handled by the UBA as if each > > separator is surrounded by: > > LRM (if the embedding level is even) or > > RLM (if the embedding level is odd) > > <end of quote> > > > > This design has the following consequences, which IMHO are not optimal: > > a) The same URL (IRI) will be displayed differently according to the > > embedding level. This is confusing. > > b) Pure Latin-character URLs will be displayed in a new and strange way > > when the embedding level is odd. For instance, > "htttp://docs.google.com <http://docs.google.com/>" > > will be displayed as "com.google.docs//:http". > > > > Consequently, I second Slim Amamou's suggestion to "have a > > predefined/enforced directionality in the specs for each scheme? (ex. LTR > > for URLs)". > > It is true that pure or mostly Hebrew or Arabic URLs will be > displayed in > > a > > way which may seem strange. For instance, "http://MY.OWN.DOMAIN.com > <http://my.own.domain.com/>" > > (where > > upper case letters represent RTL letters) will be displayed as " > > http://YM.NWO.NIAMOD.com <http://ym.nwo.niamod.com/>", but > > 1. The scheme and the TLD currently are pure LTR, and I guess that > this is > > not going to change soon, so the display of mixed LTR/RTL URLs will be > > strange anyway. > > 2. The use of domain names with RTL labels is still scarce, there is no > > common usage to overcome, so the public will get accustomed to the > > "strange" display right from the beginning. > > > > > > Shalom (Regards), Mati > > Bidi Architect > > Globalization Center Of Competency - Bidirectional Scripts > > IBM Israel > > Fax: +972 2 5870333 <tel:%2B972%202%205870333> Mobile: +972 52 > 2554160 <tel:%2B972%2052%202554160> > > > > > > > > > > From: Mark Davis ☕ <mark@macchiato.com <mailto:mark@macchiato.com>> > > To: Shawn Steele <Shawn.Steele@microsoft.com > <mailto:Shawn.Steele@microsoft.com>> > > Cc: "public-iri@w3.org <mailto:public-iri@w3.org>" <public-iri@w3.org > <mailto:public-iri@w3.org>>, bidi@unicode.org <mailto:bidi@unicode.org> > > Date: 27/04/2011 02:24 > > Subject: [bidi] Re: BIDI? > > Sent by: bidi-bounce@unicode.org <mailto:bidi-bounce@unicode.org> > > > > > > > > Here are some rough thoughts on how we could handle bidi IRIs. > > > > http://goo.gl/QwSoo > > > > Feedback is welcome. > > > > Mark > > > > On Wed, Apr 20, 2011 at 23:20, Shawn Steele > <Shawn.Steele@microsoft.com <mailto:Shawn.Steele@microsoft.com>> > > wrote: > > I'm wondering what the current thinking around BIDI IRIs is? A few things > > in draft-ietf-iri-3987bis-05 jump out at me. > > > > > > -Shawn> > -- -------------------------------------------- Adil Allawi Diwan Software Ltd. 37-39 Peckham Road, London SE5 8UH, U.K. Mobile: +44 (0)7718 785 850 mailto:adil@diwan.com http://www.diwan.com/
Received on Monday, 6 June 2011 21:54:31 UTC