W3C home > Mailing lists > Public > uri@w3.org > May 2004

RE: draft-hoffman-rfc1738bis-02.txt

From: McDonald, Ira <imcdonald@sharplabs.com>
Date: Fri, 7 May 2004 08:00:36 -0700
Message-ID: <CFEE79A465B35C4385389BA5866BEDF00C76B2@mailsrvnt02.enet.sharplabs.com>
To: "'Larry Masinter'" <LMM@acm.org>, uri@w3.org

Hi Larry,

A separate spec on 'file:' is an excellent idea.

Cheers,
- Ira

Ira McDonald (Musician / Software Architect)
Blue Roof Music / High North Inc
PO Box 221  Grand Marais, MI  49839
phone: +1-906-494-2434
email: imcdonald@sharplabs.com

-----Original Message-----
From: Larry Masinter [mailto:LMM@acm.org]
Sent: Thursday, May 06, 2004 1:03 PM
To: uri@w3.org
Subject: Re: draft-hoffman-rfc1738bis-02.txt



Re http://lists.w3.org/Archives/Public/uri/2004Apr/0055.html

> However, it also implies that character encoding is choosable by 
> users, and I think that is not the case in many systems. That is, 
> many systems will only allow an ISO 8859-x encoding for file names. 
> What you are asking is that the names in those cases must be 
> re-encoded from the "native" encoding to the standard encoding.

> That will (a) induce errors, particularly when people don't bother to 
> re-encode and (b) increase interoperability. How do people feel about 
> this balance?

I'm not convinced that it will induce errors, especially if
decoders try looking up based on local encoding if the UTF-8
decoding doesn't work.

So I would suggest

 filename -> file URL  
   SHOULD reencode from the local encoding to UTF-8

and

  file URL -> filename
     SHOULD reencode from UTF-8 -> local encoding,
    with the possibility that alternate reencoding
    (or non-encoding) might also be tried.

This is also more consistent with IRIs.

The translation between file URIs and file paths requires some
amount of reencoding anyway, on most systems, just to change the
hierarchy delimiter: from "\" (Windows UNC) or ":" (Mac OS 9) to "/".

Other notes:

I think file://usr/local/bin/  should be file:///usr/local/bin.

For 'security considerations':

http://cert.uni-stuttgart.de/archive/bugtraq/2001/07/msg00375.html

Since there are more comments on 'file' than on the other
schemes in this document, perhaps we could pull out the
"file" URI out into a separate document?

I'm willing to take a run at this, if there's no objection.

Larry
Received on Friday, 7 May 2004 11:01:24 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 5 February 2014 07:13:51 UTC