- From: Grosso, Paul <pgrosso@ptc.com>
- Date: Tue, 21 Aug 2007 10:34:46 -0400
- To: "public-xml-core-wg" <public-xml-core-wg@w3.org>
So is LEIRI pronounced "leery" as in "be leery of using LEIRIs"? p. > -----Original Message----- > From: public-xml-core-wg-request@w3.org > [mailto:public-xml-core-wg-request@w3.org] On Behalf Of Henry > S. Thompson > Sent: Tuesday, 2007 August 21 9:26 > To: public-xml-core-wg > Subject: Towards a request to I18N Core wrt LEIRIs > > Futher to > > ACTION to Henry: Explore, with the expectation of proposing, the > possibility of asking I18N Core to define "legacy extended IRIs" (by > whatever name) in the upcoming revision of the IRI RFC. > > I have discussed things, and this looks like it's worth a try. > > So, I suggest we fill in and send something along the following lines > to the I18N Core WG, but note I do _not_ understand all the subtleties > of the issues involved, so those who do please review/revise this > rigourously. > > - ---------- > > We would like to suggest that the best way to move forward with our > effort to reconcile the differences between the way in which various > specifications in the XML family allow a superset of IRIs, and the > IRI spec. itself, would be to incorporate a new section in the > revision of the IRI spec. that you are currently working on, which > would name and define a single concept to be referenced from all > those XML specs, along the following lines: > > Name (negotiable): Legacy Extended IRIs (LEIRIs) > Definition (taken from [1]): > > A Human Readable Resource Identifier (HRRI) is a sequence of > Unicode characters that can be converted into an IRI by the > application of a few simple encoding rules. > > To convert a Human Readable Resource Identifier to an IRI > reference, the following characters MUST be percent encoded: > > * the control characters #x0 to #x1F and #x7F to #x9F > * space #x20 > * the delimiters "<" #x3C, ">" #x3E, and '"' #x22 > * the unwise characters "{" #x7B, "}" #x7D, "|" #x7C, "\" #x5C, > "^" #x5E, and "`" #x60 > * characters in the Unicode private use area (#xE000-#xF8FF), > except where they appear in the query part of the resulting IRI. > > These characters are percent encoded by applying [steps 2.1 to 2.3 > of Section 3.1 of RFC 3987] to them. > > Health Warning: We would be happy to see some text added to warn > against creating new LEIRIs using most or indeed almost all of the > characters allowed by this, perhaps expanding on what is already > present in [1]: "[A]uthors of HRRIs are advised to percent encode > space characters themselves, rather than rely on the processor to > do so, because spaces are often used to separate HRRIs in a > sequence." > > We would expect to go ahead and publish several specs. which are > waiting for a resolution of this issue, e.g. XML Base 2e and XLink > 1.1, once there is a stable and agreed-final Internet Draft of a new > edition of 3987 including agreed prose along the lines given above, > leaving the insertion of the final RFC number to subsequent errata. > > - ------------- > ht > > [1] http://www.w3.org/XML/2007/04/hrri/draft-walsh-tobin-hrri-01c.html
Received on Tuesday, 21 August 2007 14:36:40 UTC