- From: Arjun Ray <aray@q2.net>
- Date: Fri, 18 Feb 2000 14:40:06 -0500 (EST)
- To: W3C HTML <www-html@w3.org>
On Fri, 18 Feb 2000, Russell Steven Shawn O'Connor wrote: > I'm a little surprised to find myself saying this, but after thinking > about it for a while, I've come to the conclusion that a URL identifier > for XHTML is as good as, and probably slightly better than an FPI > identifier. > > I was pretty convinced that the URL, was just a location, Such and such a > file on a particular machine, retrieved by a particular protocol, whereas we > want an identifier that says, this the the XHTML DTD, which is > independent of protocol, and machine. > > But lets look at the URL more carefully > > http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd > > It has (more or less) 3 parts: ``http'' ``www.w3.org`` and > ``TR/xhtml-basic/xhtml-basic10.dtd'', plus some separators for parsing. > > A more logical order is > > (1)www.w3.org > (2)http: > (3)TR/xhtml-basic/xhtml-basic10.dtd > > because first you connect to a machine, then use a the protocol with the > file. Not quite. You can't just "connect to a machine". A TCP connection involves a host *and* port combination - and a *defaulted* port is associated with a protocol (the 'http:' is what allows you to omit the ':80' after 'www.w3.org'.) > Lets look at each part, starting with (3). TR/xhtml-basic/xhtml-basic10.dtd > is just a name. Yes... in relation to a protocol. > (2) is the protocol. But because URL's are uniform, the meaning of (2) > is irrelevant to us. What does "uniform" mean? Why should http be irrelevant? (What if it's ftp?) > This brings us to (1), the most important part. [...] > www.w3.org isn't a machine, it is a virtual machine name. Yes. DNS at work:) > An important point is that the W3C owns all names in (1) of the form > ...w3.org. Yes... > Now lets look at > > -//W3C//DTD XHTML Basic 1.0//EN > > It has 3 parts too: > (1) -//W3C > (2) DTD XHTML Basic 1.0 > (3) EN > > But (2) + (3) is the same as TR/xhtml-basic/xhtml-basic10.dtd. Yes. > The .dtd says it's a DTD, lots of use of xhtml-basic. We don't know that from '.dtd' (although a Certain Big Company would love it if you fell into the habit of making such assumptions;)) And, btw, the public text class 'DTD' in (2) curiously enough means 'document type declaration subset' (ISO-8879/10.2.2.1) - note that fourth word! > www.w3.org corresponds to -//W3C. Not quite. w3.org (the domain name, rather than the host name) is a better analogue. > Both indicate the the following name > (TR/xhtml-basic/xhtml-basic10.dtd or DTD XHTML Basic 1.0) is to be > interpreted as a key for the a table controlled by the W3C. Yes. > But the W3C doesn't actually doesn't own -//W3C like it owns > www.w3.org, and anyone can make a document with the FPI -//W3C. True. Have you seen K.4.6 "Internet domain names in public identifiers" of the WebSGML TC? http://www.ornl.gov/sgml/wg8/document/1955.htm You could have something like this: +//IDN w3.org::www//DTD XHTML 1.0//EN//http:/TR/xhtml-basic/xhtml-basic10.dtd (which, amazingly enough, is best compared with a gopher string!) > So really URL's a better in this respect. No. They're exactly the same. The real problem is that, under the current rules, a URI can't be the minimum data following the PUBLIC keyword. Of course, at root, this is just legalistic mumbo-jumbo, and the SYSTEM keyword is the official *kludge* to get around this "problem". That is, there should never be a need for a PUBLIC *and* a SYSTEM identifier. All you need in a document is a name - its internal syntax is irrelevant (except for *verification* purposes). Internal syntax becomes important for an address, and all addresses should be in catalogs. Make a reasoned argument that catalogs are *never* necessary, and you have a case that an address is as good as a name. > So surprisingly, the URL is actually independent of machine name > (because of virtual machine names) and independent of protocol > (because of uniformity). Please explain this "uniformity" bit. What happens with ftp? Arjun
Received on Friday, 18 February 2000 14:15:45 UTC