"+" chars in links as bad as ampersands?

[not a list member]

To the best of my knowledge--and according to the validator itself--a link
(anchor tag) may have a plus-character in the URL.  However, if the plus-
character is part of the "uri" argument passed to the validator itself, it
gets converted to "%20" (encoded space character), which seems entirely
bogus and inexplicable.  It's *possible* this is a browser bug (NN 4.7 under
Linux), but since the Opera 6.0 beta does exactly the same thing, I would
tend to suspect the validator.

Here's a test page, itself automatically generated:


At the very bottom left is a period linked to the validator; its link is
as follows:


Either following that link or typing it in the browser's address bar results
in the following (from the validator's URI entry field):


Since the requested page (with spaces) doesn't exist, the validator fails.
However, entering the "+"-enabled URL in the validator's URI entry field
directly works just fine, as does modifying the browser's address bar to
use %2b instead of plus-characters.  (Note that either approach results in
the validation of the original, unmodified page, not of a page itself modi-
fied to use %2b's.)

So, question:  _should_ I have to escape my plus-characters when they appear
as an argument to the validator's "check" script (is that a special CGI ex-
ception of some sort), or is the script in error?

Final note:  before we added an explicit character-encoding to the pages,
at least some versions of Internet Exploder would default to UTF-7, which
apparently treats "+[digits]" as some sort of encoded character.  Is that
something the validator might be doing, too (or UTF-8, more likely)?

Greg Roelofs            newt@pobox.com             http://pobox.com/~newt/
Newtware, PNG Group, Info-ZIP, AlphaWorld Map, Philips Semiconductors, ...

Received on Thursday, 21 February 2002 00:40:02 UTC