- From: Charles Lindsey <chl@clerew.man.ac.uk>
- Date: Wed, 22 Sep 2004 14:16:45 +0100
- To: uri@w3.org
> 2. Scheme Definition
>
> The news and nntp URL schemes are used to refer to either news groups
> or individual articles of USENET news, as specified in RFC 1036.
>
> The news URL takes the form:
>
> newsURL = "news" ":" [ news-server ]
> ( newsgroup-name | '*' | message-id )
> news-server = "//" server "/"
> message-id = id-left "@" id-right
OK, that syntax is correct now, but you need a normative reference to RFC
2822 for <id-left> and <id-right> (or maybe to the definitions in the
Usefor draft, if that manages to become an RFC by the time this draft is
ready - but don't hold your breath). Presumably also to RFC 2396bis for
<server>, and it still is not clear to me whether <server> could include
user+password information, and if so what one does if the authentication
required by the server is SASL based, which will soon become the norm.
>
> A <newsgroup-name> is a period-delimited hierarchical name, such as
> "comp.lang.perl.modules". A <message-id> corresponds to the
> Message-ID of section 2.1.5 of RFC 1036 [RFC1036], without the
> enclosing "<" and ">"; it takes the form <unique>@<full_domain_name>.
> <unique> cannot be quoted text or have escaping characters.
No, it is not as simple as that, because you can also have an IP address
after the '@' and this is covered, in RFC 2822, under the guise of a
<no-fold-literal> and that brings in quoted text and escaping characters
again.
So I think what you have to say is something like:
"The <id-left> and the <id-right> MUST be in a canonical form in which no
<quoted-string> or <quoted-pair> is used in a context where the same
semantic meaning could have been rendered without such quoting;
moreover, no whitespace may be included, whether %-encoded or not and/or
quoted or not.
For example, neither
news:"abcd"@example.com
nor
"ab\cd"@example.com
is in canonical form, because the form
abcd@example.com
is available."
Yes, there are indeed email systems around that will happily treat email
message identifiers using those three forms as being identical, which is
of course a complete no-no in News.
>
> If <newsgroup-name> is "*" (as in <URL:news:*>), it is used to refer
> to "all available news groups".
OK, that feature has been available since RFC 1738, but I am not at all
sure what it is meant to DO. I just tried it on my browser, and it was
totally confused, telling me "411 Invalid group name (not in active).". I
would be quite happy to see it simply dropped (unless someone can point me
to a system that does something useful with it).
Next, we really need some text to explain what resource is supposed to be
retrieved by this URL. Something like:
"The resource retrieved by this URL is the Netnews article with the
given <message-id>. In a properly working Netnews system, the same
article will be obtained whatever server is accessed for the purpose
(assuming the server in question carried that article in the first
place and that it has not expired). If no <server> is specified, the
article is to be retrieved from whatever server has been configured
for local use."
One may then need wording as to whether this is truly a global resource,
such as people have been discussing regarding the file scheme. I am
keeping out of that one myself - the point may become more relevant if we
bring back the nntp scheme. I notice that RFC 1738 contained the paragraph:
"The news URLs are unusual in that by themselves, they do not contain
sufficient information to locate a single resource, but, rather, are
location-independent."
I am far from clear what that actually means (if anything), but maybe it
is related to what I was trying to say in my suggested paragraph above.
AFAICS, they do indeed locate a single resource (if you count different
copies of the same article as "single").
>
> The nntp URL defined in RFC 1738 is deprectated.
No, I don't think we ever agreed that, and a couple of people have pointed
out places where it is implemented. I have also seen it as the recommended
method, in Opera, to force the system to reload an article from the server
if the client has lost it somehow.
So I would be in favour of bringing it back. I might even be persuaded to
combine it with the news scheme as originally proposed, but only if we
establish first exactly what it is meant to do on its own.
--
Charles H. Lindsey ---------At Home, doing my own thing------------------------
Tel: +44 161 436 6131 Fax: +44 161 436 6133 Web: http://www.cs.man.ac.uk/~chl
Email: chl@clerew.man.ac.uk Snail: 5 Clerewood Ave, CHEADLE, SK8 3JU, U.K.
PGP: 2C15F1A9 Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5
Received on Wednesday, 22 September 2004 18:06:43 UTC