- From: Frank Ellermann <nobody@xyzzy.claranet.de>
- Date: Mon, 25 Jul 2005 06:12:29 +0200
- To: uri@w3.org
Hi, I've found two or three potential problems in RfC 3986 appendix D.2, or I simply don't get the idea (still hunting those non-uric characters known as "unsafe" in RfC 1738, or as <delims> and <unwise> in RfC 2396). 1 - uric, with appendix D.2 I get: 1738 XCHAR: ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _ 2396 URIC : ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _ ~ 2396 URIC : ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _ ~ 3986 URIC_D2: ALNUM $ % & + , - . / : ; = ? @ _ ~ 3986 D.2 doesn't add <reserved> like 1738 and 2396. If I try to fix it by adding <reserved> also in 3986 the result is more plausible: 2396 URIC : ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _ ~ 3986 URIC3: ALNUM ! # $ % & ' ( ) * + , - . / : ; = ? @ [ ] _ ~ 1 - mark, with appendix D.2 I get: 1738 UNRESERVED1: ALNUM ! $ ' ( ) * + , - . _ 2396 UNRESERVED2: ALNUM ! ' ( ) * - . _ ~ 1738 SAFE_EXTRA: ! $ ' ( ) * + , - . _ 2396 MARK : ! ' ( ) * - . _ ~ In other words <mark> is the same as <unreserved> excluding <alphanum>. 2396 UNRESERVED2: ALNUM ! ' ( ) * - . _ ~ 3986 UNRESERVED3: ALNUM - . _ ~ 2396 MARK : ! ' ( ) * - . _ ~ 3986 MARK3: ! ' ( ) * - . _ ~ In 3986 D.2 it's the same old <mark>, no proper subset of <unreserved>. IMHO it should be only "-", ".", "_", "~". 3 - nouric, determined indirectly as all VCHAR excl. the (fixed) <uric>: 1738 UNSAFE : " # % < > [ \ ] ^ ` { | } ~ 2396 DELIM_UNWISE: " # % < > [ \ ] ^ ` { | } 3986 NOURIC3 : " < > \ ^ ` { | } Is that correct ? Is it an omission in appendix D.2 ? Something like: | delims | <"> / "<" / ">" | | unwise | "\" / "^" / "`" / "{" / "|" / "}" | Apparently (?) the complete set of excluded ASCII characters would be: ugly = CTL / SP / DQUOTE / "<" / ">" / "\" / "^" / "`" / "{" / "|" / "}" Bye, Frank
Received on Monday, 25 July 2005 04:15:42 UTC