W3C home > Mailing lists > Public > public-xg-mmsem@w3.org > April 2007

RE: Finding News Taxonomies [was: RE: Towards a TAG consideration of CURIEs]

From: Misha Wolf <Misha.Wolf@reuters.com>
Date: Sat, 07 Apr 2007 17:33:20 +0100
To: www-tag@w3.org, semantic-web@w3.org, public-xg-mmsem@w3.org, newsml-g2@yahoogroups.com
Message-id: <A29ADE959C70A1449470AA9A212F5D8004E9E1B9@LONSMSXM06.emea.ime.reuters.com>

John Cowan wrote:

> Misha Wolf scripsit:
> 
> > This:
> >    http://www.iptc.org/docs/newscodes.html#123456
> > is not legal, as "123456" is an illegal fragment identifier.
> 
> Not exactly.  We can decompose this into three claims, two false
> and one true.
> 
> 1) "123456" is an invalid fragment: false.  If you look at the
> syntax rules in RFC 3986, you see that every character in a 
> fragment can be a digit.
> 
> 2) "123456" can't be the value of an XML attribute of type ID: 
> false.  An XML document may contain attributes of type ID in one 
> of two ways: every attribute with the name "xml:id" is of type ID,
> and so is any attribute declared in the DTD (internal or external)
> to have type ID.  Such attributes may contain any value, and the 
> document is well-formed.
> 
> 3) "123456" can't be the value of an attribute of type ID in a
> *valid* XML document: true.  However, plenty of documents are not
> valid: in particular, any document without a DTD is not valid, and
> there is nothing wrong with having a DTD without expecting or 
> requiring validity.

I'm out of my depth here.  At the W3C AC meeting in Edinburgh, last 
year, I understood Henry to be stating that something like:

   http://www.iptc.org/docs/newscodes.html#123456

is not legal, as "123456" is an illegal fragment identifier.  It may
be that the resulting XML document is legal, but that the use in 
(X)HTML is illegal?

Misha Wolf
News Standards Manager, Reuters, http://www.reuters.com/
Vice Chair, News Architecture WP, IPTC, http://www.iptc.org/

This email was sent to you by Reuters, the global news and information company. 
To find out more about Reuters visit www.about.reuters.com

Any views expressed in this message are those of the individual sender, 
except where the sender specifically states them to be the views of Reuters Limited.

Reuters Limited is part of the Reuters Group of companies, of which Reuters Group PLC is the ultimate parent company.
Reuters Group PLC - Registered office address: The Reuters Building, South Colonnade, Canary Wharf, London E14 5EP, United Kingdom
Registered No: 3296375
Registered in England and Wales
Received on Saturday, 7 April 2007 16:33:41 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 January 2008 14:21:21 GMT