- From: Joseph A Holsten <joseph@josephholsten.com>
- Date: Mon, 21 Sep 2009 09:04:29 -0400
- To: uri-review@ietf.org
URI people: I intend to replace the current about: scheme Encoding Considerations[1]: Because many characters are not permitted with this syntax, the "segment" and "query" elements may contain characters from the Unicode Character Set [UCS] as suggested by URI [RFC3986], by first encoding those characters as octets to the UTF-8 character encoding [RFC3629]; then only those octets that do not correspond to characters in the unreserved set should be percent-encoded. By using UTF-8 encoding, there are no known compatibility issues with mapping Internationlized Resource Identifiers to about URIs according to [RFC3987]. Since about URIs do not use domain names, "ireg-name" conversion is unnecessary. with the following (adapted from hixie's ws: scheme[2]): Characters in the "segment" or "query" parts that are excluded by the syntax defined above must be converted from Unicode to ASCII by first encoding the characters as UTF-8 and then replacing the corresponding bytes using their percent-encoded form as defined in the URI and IRI specifications. [RFC3986] [RFC3987] Any objections or issues? 1: http://tools.ietf.org/html/draft-holsten-about-uri-scheme-02#section-4 2: http://tools.ietf.org/html/draft-hixie-thewebsocketprotocol-41#section-8.1 -- Joseph Holsten http://josephholsten.com mailto:joseph@josephholsten.com tel:+1-918-948-6747
Received on Monday, 21 September 2009 13:04:39 UTC