- From: Larry Masinter <masinter@adobe.com>
- Date: Wed, 1 Apr 2009 20:48:28 -0700
- To: Martin J. Dürst <duerst@it.aoyama.ac.jp>
- CC: Juergen Schoenwaelder <j.schoenwaelder@jacobs-university.de>, "uri-review@ietf.org" <uri-review@ietf.org>, Martin Bjorklund <mbj@tail-f.com>, David Partain <david@partain.se>, "uri@w3.org" <uri@w3.org>
I'm not sure that "necessary" percent encoding is well defined
Sent from my iPhone
On Apr 1, 2009, at 7:22 PM, "Martin J. Dürst" <duerst@it.aoyama.ac.jp>
wrote:
> Hello Juergen,
>
> I think this is a question about generic URI syntax, and therefore
> should go to uri@w3.org (which I have cc'ed).
>
> Did you also consider having a datatype for IRIs?
>
> Regards, Martin.
>
> P.S.: Please remove uri-review@ietf.org from the cc list when
> replying.
>
> On 2009/04/01 18:12, Juergen Schoenwaelder wrote:
>> Hi,
>>
>> I am editor of a document defining among other things a URI data type
>> for the YANG data modeling language (NETMOD working group). Right
>> now,
>> our definition looks as follows (<draft-ietf-netmod-yang-types-02>):
>>
>> typedef uri {
>> type string; // [TODO] add the regex from RFC 3986 here?
>> description
>> "The uri type represents a Uniform Resource Identifier
>> (URI) as defined by STD 66.
>>
>> Objects using the uri type must be in US-ASCII encoding,
>> and MUST be normalized as described by RFC 3986 Sections
>> 6.2.1, 6.2.2.1, and 6.2.2.2. All unnecessary
>> percent-encoding is removed, and all case-insensitive
>> characters are set to lowercase except for hexadecimal
>> digits, which are normalized to uppercase as described in
>> Section 6.2.2.1.
>>
>> The purpose of this normalization is to help provide
>> unique URIs. Note that this normalization is not
>> sufficient to provide uniqueness. Two URIs that are
>> textually distinct after this normalization may still be
>> equivalent.
>>
>> Objects using the uri type may restrict the schemes that
>> they permit. For example, 'data:' and 'urn:' schemes
>> might not be appropriate.
>>
>> A zero-length URI is not a valid URI. This can be used to
>> express 'URI absent' where required
>>
>> This type is in the value set and its semantics equivalent
>> to the Uri textual convention of the SMIv2.";
>> reference
>> "RFC 3986: Uniform Resource Identifier (URI): Generic Syntax
>> RFC 3305: Report from the Joint W3C/IETF URI Planning Interest
>> Group: Uniform Resource Identifiers (URIs), URLs,
>> and Uniform Resource Names (URNs): Clarifications
>> and Recommendations
>> RFC 5017: MIB Textual Conventions for Uniform Resource
>> Identifiers (URIs)";
>> }
>>
>> One particular question is whether it is safe to add the following
>> pattern restriction (XSD regular expression syntax):
>>
>> type string {
>> pattern '(([^:/?#]+):)?(//([^/?#]*))?([^?#]*)(\?([^#]*))?
>> (#(.*))?';
>> }
>>
>> The regular expression is taken from appendix B of RFC 3986.
>>
>> /js
>>
>
> --
> #-# Martin J.Dürst, Professor, Aoyama Gakuin University
> #-# http://www.sw.it.aoyama.ac.jp mailto:duerst@it.aoyama.ac.jp
>
Received on Thursday, 2 April 2009 03:49:32 UTC