- From: Frank Ellermann <nobody@xyzzy.claranet.de>
- Date: Fri, 14 Oct 2005 23:30:22 +0200
- To: uri@w3.org
Roy T. Fielding wrote:
>> can I now submit the RfC 3986 appendix D.2> problems as erratum ?
> No, it isn't an error.
IBTD, 2396 <uric> and its predecessor 1738 <xchar> used to be
"any ASCII you might find in an URL". Excl. '#', because the
fragments were not considered to be a part of the URL.
But 3986 D2 removed ! ' ( ) * from its new <uric> set instead
of adding # [ ].
Similar 2396 <mark> and its predecessor 1738 <safe> + <extra>
used to be the same as <unreserved> minus ALPHA + DIGIT.
3986 D2 kept <mark> as is instead of moving ! ' ( ) * to <uric>
> You are not supposed to use the terms in appendix D for new
> specifications.
That's clear, they are used to interpret pre-3986 URI-schemes.
> intentionally conservative to avoid the creation of bad URIs.
But it's not more okay to use ! ' ( ) * freely in URLs. And an
old rule "%-encode everything that's no <uric>" would miss the
special case IPv6address.
> Why don't you simply use the current rules in the body of
> 3986?
Now that's exactly what I want, but there's no list of the new
non-<uric> VCHARs, that's the old 2396 <delims> + <unwise> and
its predecessor 1738 <unsafe>.
> Just ignore appendix D for new specifications.
That doesn't help to port old specifications, especially some
missing schemes of 1738. It's also relevant for 2368bis, the
last draft still used <uric>, or I confuse it with another I-D.
If I got it right then the following nine VCHARs in addition to
'%' always have to be %-encoded under 3986: " < > \ ^ ` { | }
That's not mentioned anymore in 3986. It requires a script for
set operations on VCHAR to determine the nine "ugly" characters,
formerly known as <delims> + <unwise> or <unsafe>.
Bye, Frank
Received on Friday, 14 October 2005 21:42:13 UTC