- From: Misha Wolf <Misha.Wolf@reuters.com>
- Date: Thu, 19 Dec 2002 10:34:15 +0000
- To: Dan Connolly <connolly@w3.org>
- Cc: www-tag@w3.org
Dan, > On Wed, 2002-12-18 at 08:43, Misha Wolf wrote: > > The main question for me is on which strings do you carry > > out the strcmp: > > > > 1. The strings as they appear in (an) XML document(s) > > > > 2. The strings processed in accordance with: > > http://www.w3.org/TR/REC-xml#AVNormalize > > > > The first is wrong; the second is right. > > Well, provided the string contains only US-ASCII characters, > that's pretty much the end of the story. Consider: a) numeric character references (decimal and hexadecimal) b) entities All of these can be used for US-ASCII characters. Misha > But if it has other characters (i.e. if it's not actually > a URI), you have to reduce > it to the URI character set before doing the > strcmp(). > > And if what's in the XML document is a relative > URI reference, you need to combine it with the > base before doing the strcmp(). > > So yes, I think there's something worth saying. > > -- > Dan Connolly, W3C http://www.w3.org/People/Connolly/ -------------------------------------------------------------- -- Visit our Internet site at http://www.reuters.com Get closer to the financial markets with Reuters Messaging - for more information and to register, visit http://www.reuters.com/messaging Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Reuters Ltd.
Received on Thursday, 19 December 2002 05:34:44 UTC