Resolving DC PURLs

Hi Tom,

On Friday, 26 March, Ian Davis (CTO of Talis) reported that purl.org was rejecting some requests to Dublin Core PURLs [1].  I asked OCLC to increase the number of threads used by their PURL server and the maximum number of concurrent connections.  They complied this afternoon.

At the time of the failures, OCLC reported that purl.org was rejecting between 20 and 60 requests per second for DC terms [2].

It would seem that Linked Data clients are becoming more prevalent and that some of them are not particularly well behaved.  Perhaps we need to raise awareness of the importance of caching.

This incident points out the criticality of DC terms to the Linked Data community and the fragility of a single point of failure such as purl.org.  The PURL Federation development recently announced by NCBO and Zepheira may eventually serve to remove the single point of failure, but the criticality of service is likely to get worse with time.

This message is simply an advisory to the DC community of some practical issues arising from the use of DC terms by the Linked Data community and requires no immediate action.  I would ask, though, that awareness of these issues be kept in mind as Dublin Core considers the management of its identifiers.

[1]  http://twitter.com/IanD
[2]  Personal correspondence from Tom Dehn at OCLC
[3]  http://zepheira.com/publications/news/#PURLFederationDevelopment


Regards,
Dave
--
David Wood, Ph.D.
Partner
Zepheira - The Art of Data
http://zepheira.com/team/dave/

Received on Monday, 29 March 2010 23:22:39 UTC