- From: Misha Wolf <Misha.Wolf@reuters.com>
- Date: Thu, 19 Dec 2002 10:34:15 +0000
- To: Dan Connolly <connolly@w3.org>
- Cc: www-tag@w3.org
Dan,
> On Wed, 2002-12-18 at 08:43, Misha Wolf wrote:
> > The main question for me is on which strings do you carry
> > out the strcmp:
> >
> > 1. The strings as they appear in (an) XML document(s)
> >
> > 2. The strings processed in accordance with:
> > http://www.w3.org/TR/REC-xml#AVNormalize
> >
> > The first is wrong; the second is right.
>
> Well, provided the string contains only US-ASCII characters,
> that's pretty much the end of the story.
Consider:
a) numeric character references (decimal and hexadecimal)
b) entities
All of these can be used for US-ASCII characters.
Misha
> But if it has other characters (i.e. if it's not actually
> a URI), you have to reduce
> it to the URI character set before doing the
> strcmp().
>
> And if what's in the XML document is a relative
> URI reference, you need to combine it with the
> base before doing the strcmp().
>
> So yes, I think there's something worth saying.
>
> --
> Dan Connolly, W3C http://www.w3.org/People/Connolly/
-------------------------------------------------------------- --
Visit our Internet site at http://www.reuters.com
Get closer to the financial markets with Reuters Messaging - for more
information and to register, visit http://www.reuters.com/messaging
Any views expressed in this message are those of the individual
sender, except where the sender specifically states them to be
the views of Reuters Ltd.
Received on Thursday, 19 December 2002 05:34:44 UTC