RE: Towards a request to I18N Core wrt LEIRIs

So is LEIRI pronounced "leery" as in "be leery of using LEIRIs"?

p.

> -----Original Message-----
> From: public-xml-core-wg-request@w3.org 
> [mailto:public-xml-core-wg-request@w3.org] On Behalf Of Henry 
> S. Thompson
> Sent: Tuesday, 2007 August 21 9:26
> To: public-xml-core-wg
> Subject: Towards a request to I18N Core wrt LEIRIs
> 
> Futher to 
> 
>   ACTION to Henry: Explore, with the expectation of proposing, the
>   possibility of asking I18N Core to define "legacy extended IRIs" (by
>   whatever name) in the upcoming revision of the IRI RFC.
> 
> I have discussed things, and this looks like it's worth a try.
> 
> So, I suggest we fill in and send something along the following lines
> to the I18N Core WG, but note I do _not_ understand all the subtleties
> of the issues involved, so those who do please review/revise this
> rigourously.
> 
> - ----------
> 
>   We would like to suggest that the best way to move forward with our
>   effort to reconcile the differences between the way in which various
>   specifications in the XML family allow a superset of IRIs, and the
>   IRI spec. itself, would be to incorporate a new section in the
>   revision of the IRI spec. that you are currently working on, which
>   would name and define a single concept to be referenced from all
>   those XML specs, along the following lines:
> 
>   Name (negotiable): Legacy Extended IRIs (LEIRIs)
>   Definition (taken from [1]):
> 
>    A Human Readable Resource Identifier (HRRI) is a sequence of
>    Unicode characters that can be converted into an IRI by the
>    application of a few simple encoding rules.
> 
>    To convert a Human Readable Resource Identifier to an IRI
>    reference, the following characters MUST be percent encoded:
> 
>     * the control characters #x0 to #x1F and #x7F to #x9F
>     * space #x20
>     * the delimiters "<" #x3C, ">" #x3E, and '"' #x22
>     * the unwise characters "{" #x7B, "}" #x7D, "|" #x7C, "\" #x5C,
>       "^" #x5E, and "`" #x60
>     * characters in the Unicode private use area (#xE000-#xF8FF),
>       except where they appear in the query part of the resulting IRI.
> 
>    These characters are percent encoded by applying [steps 2.1 to 2.3
>    of Section 3.1 of RFC 3987] to them.
> 
>   Health Warning: We would be happy to see some text added to warn
>    against creating new LEIRIs using most or indeed almost all of the
>    characters allowed by this, perhaps expanding on what is already
>    present in [1]: "[A]uthors of HRRIs are advised to percent encode
>    space characters themselves, rather than rely on the processor to
>    do so, because spaces are often used to separate HRRIs in a
>    sequence."
> 
>   We would expect to go ahead and publish several specs. which are
>   waiting for a resolution of this issue, e.g. XML Base 2e and XLink
>   1.1, once there is a stable and agreed-final Internet Draft of a new
>   edition of 3987 including agreed prose along the lines given above,
>   leaving the insertion of the final RFC number to subsequent errata.
> 
> - -------------
> ht
> 
> [1] http://www.w3.org/XML/2007/04/hrri/draft-walsh-tobin-hrri-01c.html

Received on Tuesday, 21 August 2007 14:36:40 UTC