W3C home > Mailing lists > Public > www-tag@w3.org > December 2002

RE: uri-comp draft necessary?

From: Misha Wolf <Misha.Wolf@reuters.com>
Date: Thu, 19 Dec 2002 10:34:15 +0000
Message-ID: <T5f42c77c5bc407b70679c@reuters.com>
To: Dan Connolly <connolly@w3.org>
Cc: www-tag@w3.org

Dan,

> On Wed, 2002-12-18 at 08:43, Misha Wolf wrote:
> > The main question for me is on which strings do you carry 
> > out the strcmp:
> > 
> > 1.  The strings as they appear in (an) XML document(s)
> > 
> > 2.  The strings processed in accordance with:
> >     http://www.w3.org/TR/REC-xml#AVNormalize
> > 
> > The first is wrong; the second is right.
> 
> Well, provided the string contains only US-ASCII characters,
> that's pretty much the end of the story.

Consider:
a)  numeric character references (decimal and hexadecimal)
b)  entities

All of these can be used for US-ASCII characters.

Misha

> But if it has other characters (i.e. if it's not actually
> a URI), you have to reduce
> it to the URI character set before doing the
> strcmp().
> 
> And if what's in the XML document is a relative
> URI reference, you need to combine it with the
> base before doing the strcmp().
> 
> So yes, I think there's something worth saying.
> 
> -- 
> Dan Connolly, W3C http://www.w3.org/People/Connolly/



-------------------------------------------------------------- --
        Visit our Internet site at http://www.reuters.com

Get closer to the financial markets with Reuters Messaging - for more
information and to register, visit http://www.reuters.com/messaging

Any views expressed in this message are those of  the  individual
sender,  except  where  the sender specifically states them to be
the views of Reuters Ltd.
Received on Thursday, 19 December 2002 05:34:44 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 26 April 2012 12:47:14 GMT