W3C home > Mailing lists > Public > www-tag@w3.org > September 2013

Re: Link rot in Supreme Court decisions

From: Herbert van de Sompel <hvdsomp@gmail.com>
Date: Wed, 25 Sep 2013 08:53:04 -0600
Message-ID: <CAOywMHfgs_VXK2jgiYa0GsqJorOmXfckQjiW92=G6dF7nCv7AQ@mail.gmail.com>
To: Larry Masinter <masinter@adobe.com>
Cc: Mark Nottingham <mnot@mnot.net>, "www-tag@w3.org WG" <www-tag@w3.org>
On Wednesday, September 25, 2013, Larry Masinter wrote:

> to archive documents for a long time, you need more than just link
> stability. You need document formats designed for archive, and you need a
> storage system that can guarantee integrity for the lifetime of the
> information archived.
>
> My take on this was http://larry.masinter.net/0603-archiving.pdf -- your
> answers might be different but the questions remain.


There's a large international community working on those problems under the
umbrella "digital preservation". See e.g.
http://en.wikipedia.org/wiki/Digital_preservation

The document I shared does not claim it solves long term archiving of
documents. It focuses on a single problem that people in certain pockets of
the web are increasingly concerned about: link stability.

Cheers

Herbert



>
> Larry
>
>
> > -----Original Message-----
> > From: Herbert Van de Sompel [mailto:hvdsomp@gmail.com <javascript:;>]
> > Sent: Tuesday, September 24, 2013 2:58 PM
> > To: Karl Dubost
> > Cc: Mark Nottingham; www-tag@w3.org <javascript:;> WG
> > Subject: Re: Link rot in Supreme Court decisions
> >
> > Karl
> >
> > I couldn't agree more. If you find a moment, pls read the document
> >
> > http://mementoweb.org/missing-link/
> >
> > Herbert
> >
> > Sent from my iPhone
> >
> > On Sep 24, 2013, at 16:45, Karl Dubost <karl@la-grange.net<javascript:;>>
> wrote:
> >
> > > Mark Nottingham [2013-09-23T21:50]:
> > >> I'm sure some here will enjoy / be horrified by this.
> > >> http://www.nytimes.com/2013/09/24/us/politics/in-supreme-court-
> > opinions-clicks-that-lead-
> > nowhere.html?partner=rss&emc=rss&utm_source=twitterfeed&utm_medium=t
> > witter&_r=0
> > >
> > > horrified by the tracking link ;) :p
> > >
> http://www.nytimes.com/2013/09/24/us/politics/in-supreme-court-opinions-
> > clicks-that-lead-nowhere.html
> > >
> > > That said:
> > >
> > > * Identifier
> > > * Actual content
> > > * Duplication
> > >  - Location
> > >  - Rights
> > >
> > > Many things which are published on papers are extremely resistant to
> time,
> > because of the third property, aka duplicated in many places apart from
> each
> > other. In addition to that, libraries have a special status with regards
> to the law
> > on keeping and circulate copies of a work.
> > >
> > > On the Web, there are cache systems but not really with an archiving
> policy.
> > And we have the issue of most of the time what is identified is
> what/where is
> > stored. Practical for fresh information, catastrophic for the fabric of
> time. In an
> > aesthetics of the Web, rust is spreading quite quickly.
> > >
> > > In addition to that, the identifier is dependent on the location owner
> (domain
> > name).
> > >
> > > --
> > > Karl Dubost
> > > http://www.la-grange.net/karl/
> > >
> > >
>
>

-- 
Herbert Van de Sompel
Digital Library Research & Prototyping
Los Alamos National Laboratory, Research Library
http://public.lanl.gov/herbertv/

==
Received on Wednesday, 25 September 2013 14:53:35 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:33:21 UTC