- From: Joseph A Holsten <joseph@josephholsten.com>
- Date: Mon, 21 Sep 2009 09:04:29 -0400
- To: uri-review@ietf.org
URI people:
I intend to replace the current about: scheme Encoding
Considerations[1]:
Because many characters are not permitted with this syntax, the
"segment" and "query" elements may contain characters from the
Unicode Character Set [UCS] as suggested by URI [RFC3986], by first
encoding those characters as octets to the UTF-8 character encoding
[RFC3629]; then only those octets that do not correspond to
characters in the unreserved set should be percent-encoded.
By using UTF-8 encoding, there are no known compatibility issues
with
mapping Internationlized Resource Identifiers to about URIs
according
to [RFC3987]. Since about URIs do not use domain names, "ireg-name"
conversion is unnecessary.
with the following (adapted from hixie's ws: scheme[2]):
Characters in the "segment" or "query" parts that are excluded by
the
syntax defined above must be converted from Unicode to ASCII by
first
encoding the characters as UTF-8 and then replacing the
corresponding
bytes using their percent-encoded form as defined in the URI and IRI
specifications. [RFC3986] [RFC3987]
Any objections or issues?
1: http://tools.ietf.org/html/draft-holsten-about-uri-scheme-02#section-4
2: http://tools.ietf.org/html/draft-hixie-thewebsocketprotocol-41#section-8.1
--
Joseph Holsten
http://josephholsten.com
mailto:joseph@josephholsten.com
tel:+1-918-948-6747
Received on Monday, 21 September 2009 13:04:39 UTC