- From: Mike Brown <mike@skew.org>
- Date: Sat, 8 Oct 2005 16:08:36 -0600 (MDT)
- To: www-forms-editor@w3.org
Hi all,
I was disappointed to see that my February suggestions for fixing XForms 1.0's
botched revamp of application/x-www-form-urlencoded did not not make it into
the XForms 1.0 2nd ed. Proposed Edited Recommendation. I am wondering what
happened; I never heard anything about it.
In February 2005 [1] I had asked that consideration be given to an erratum
to fix a couple of issues with section 11.6 of the XForms 1.0 spec. There
was a major issue and a minor one.
The minor one, which I don't really care about anymore, was just the
phrasing of the reference to the now-superseded RFC 2396. It's not worth
pursuing; the intent of the current wording is clear enough.
The major one is that it uses the term "non-ASCII and reserved" where it
should use "non-ASCII and non-unreserved".
The only characters that are safe to leave in their raw form in this media
type, or in any URI, are those in the "unreserved" set:
A-Z, a-z, 0-9, ~, -, _, and period.
The "reserved" set is for characters that have (or may, in some cases) have
special meaning. The set, as of RFC 3986, consists of:
!, *, ', (, ), ;, :, @, &, =, +, $, /, ?, %, #, [, ], and comma.
When you say "non-ASCII and reserved" are the ones that need to be converted
to UTF-8 and percent-encoded, it is implicit that the unreserved set is safe
to leave as-is. But, aside from space, LF and CR, you have not accounted for
a number of characters that are in ASCII, but are not in the reserved or
unreserved sets.
This would most easily be addressed by just changing the "reserved" to
"non-unreserved" in the specification. This would restore the intended
behavior to match previous specifications and implementations.
Thanks,
Mike
[1] http://lists.w3.org/Archives/Public/www-forms/2005Feb/0028.html
(please forgive my misspelling of 'superseded' therein)
Received on Saturday, 8 October 2005 22:08:41 UTC