W3C home > Mailing lists > Public > semantic-web@w3.org > June 2011

Fwd: Re: Silk - Link Discovery Framework Version 2.4 release

From: Federico M. Facca <federico.facca@create-net.org>
Date: Wed, 01 Jun 2011 17:45:38 +0200
Message-ID: <4DE65EA2.3080205@create-net.org>
To: paoladimaio10@googlemail.com
CC: robertisele@googlemail.com, marta.nagy-rothengass@ec.europa.int, Constantijn.Van-Oranje-Nassau@ec.europa.eu, semantic-web@w3.org
Hash: SHA1

FYI it seems I replied only to the LOD list and the list does not like
too much my email address.

- -------- Original Message --------
Subject: Re: Silk - Link Discovery Framework Version 2.4 release
Date: Wed, 01 Jun 2011 17:35:53 +0200
From: Federico M. Facca <federico.facca@create-net.org>
Organisation: CREATE-NET
To: public-lod@w3.org

Dear Paola,
I share your concerns on public money spending in EU projects.
I am not part of LOD2 project neither fu-berlin, but looking at the page
content, code commits and wikis of this release, I assume that
what you scouted is a simple error. I am still writing 2010 in some
documents too :) as matter of fact humans hate that the time is passing by!

Of course this does not mean that LOD is spending money better or worst
than other projects or whatever.

Generally I have to say that I like the idea of silk and coming from
data mining research, having gone through web industry, moved to
semantic web research, and then back to web engineering:
I know how much this is needed and how much hard it can be to be achieved..

All the best,


2.4	Added the new Silk Workbench, a web application which guides the
user through the process of interlinking different data sources.
2.3	Improved loading perfomance: Multiple parallel SPARQL queries are
executed, while their results are merged on the fly.
Improved matching performance: New blocking method offers greatly
improved performance.
Improved overall performance: Matching tasks are now executed
concurrently to loading data instead of waiting for the complete data
set to be loaded.
2.2	Added Silk MapReduce	2010-10-06
2.1	Added Silk Server
Added a geographical distance metric by Konrad Höffner (MOLE subgroup of
Research Group AKSW, University of Leipzig)
Bugfixes	2010-09-15
2.0	Reimplementation of the Silk framework in Scala.
Improved scalability and performance.
Prematching replaced by a more transparent blocking.
Configuration is checked for consistency prior to link generation.
Support of the OAEI Alignment format.
(Anja and Robert)	2010-07-01
Added prematching of data items (Julius).
The Silk 0.2 language specification is still available and Silk 0.2
framework can be downloaded from GoogleCode.

0.1	Initial Release of the Phyton version of the Silk framework
(Julius and Chris)	2009-02-01

On 01/06/11 17.16, Paola Di Maio wrote:
> Robert
> thanks  lot for the update, I look forward to be trying it out
> I see from this page
> http://www4.wiwiss.fu-berlin.de/bizer/silk/
> that SILK V 2.4, announced on this list today was actually released
> last year: See the snippet below
> 2010-06-01: Version 2.4 released including the new Silk Workbench, a
> web application which guides the user through the process of
> interlinking different data sources.
> I also seem to understand from the project page that much of LOD2
> software are tools developed in previous years (ie, nothing new!)
> Am I reading something wrong?
> In the past decade or so, millions of euros of tax payers money have
> been paid for projects for which the codebase had already been
> developed, either by funded projects from prior calls( ie, for which
> the tax payer had already paid ) or by other companies.
> In essence, as it has been already pointed out, the public has been
> paying for the same semantic web tools to be rebranded over and over,
> and each time it has costed lots of public money, and each time it has
> not delivered the semantic web functionality the public is waiting for
> (ie, a useable web based application layer)
> Since LOD2 has become a funded EU project in September 2010, I would
> be grateful if you could explain what part of the tool/functionality
> has been developed after September 2010, and for what part of this
> development is the public funding being used for
> Thanks a lot in advance
> On Wed, Jun 1, 2011 at 3:35 PM, Robert Isele <robertisele@googlemail.com> wrote:
>> Hi all,
>> we are happy to announce version 2.4 of the Silk - Link Discovery
>> Framework for the Web of Data.
>> The central idea of the Web of Data is to interlink data items using
>> RDF links. However, in practice most data sources are not sufficiently
>> interlinked with related data sources. The Silk Link Discovery
>> Framework addresses this problem by providing tools to generate links
>> between data items based on user-provided link specifications. It can
>> be used by data publishers to generate links between datasets as well
>> as by Linked Data consumers to augment Web data with additional RDF
>> links.
>> Link specifications can either be written manually or developed using
>> the new Silk Workbench. The Silk Workbench, is a web application which
>> guides the user through the process of interlinking different data
>> sources. It’s being shipped with the 2.4 version of Silk.
>> The Silk Workbench offers the following features:
>> - It enables the user to manage different sets of data sources and
>> linking tasks.
>> - It offers a graphical editor which enables the user to easily create
>> and edit link specifications.
>> - As finding a good linking heuristics is usually an iterative
>> process, the Silk Workbench makes it possible for the user to quickly
>> evaluate the links which are generated by the current link
>> specification.
>> - It allows the user to create and edit a set of reference links used
>> to evaluate the current link specification.
>> The Silk Link Discovery Framework includes three applications to
>> execute the link specifications which address different use cases:
>> 1. Silk Single Machine is used to generate RDF links on a single
>> machine. The datasets that should be interlinked can either reside on
>> the same machine or on remote machines which are accessed via the
>> SPARQL protocol. Silk Single Machine provides multithreading and
>> caching. In addition, the performance can be further enhanced using an
>> optional blocking feature.
>> 2. Silk Server can be used as an identity resolution component within
>> applications that consume Linked Data from the Web. Silk Server
>> provides an HTTP API for matching instances from an incoming stream of
>> RDF data while keeping track of known entities. It can be used for
>> instance together with a Linked Data crawler to populate a local
>> duplicate-free cache with data from the Web.
>> 3. Silk MapReduce is used to generate RDF links between datasets using
>> a cluster of multiple machines. Silk MapReduce is based on Hadoop and
>> can for instance be run on Amazon Elastic MapReduce. Silk MapReduce
>> enables Silk to scale out to very big datasets by distributing the
>> link generation to multiple machines.
>> More information about the Silk framework, the Silk Link Specification
>> Language, as well as several examples that demonstrate how Silk is
>> used to set links between different data sources in the LOD cloud is
>> found at:
>> http://www4.wiwiss.fu-berlin.de/bizer/silk/
>> The Silk framework is provided under the terms of the Apache License,
>> Version 2.0 and can be downloaded from
>> http://www4.wiwiss.fu-berlin.de/bizer/silk/releases/
>> The development of Silk was supported by Vulcan Inc. as part of its
>> Project Halo (www.projecthalo.com) and by the EU FP7 project LOD2 -
>> Creating Knowledge out of Interlinked Data (http://lod2.eu/, Ref. No.
>> 257943).
>> Thanks to  Christian Becker, Michal Murawicki and Andrea Matteini for
>> contributing to the Silk Workbench.
>> Happy linking,
>> Robert Isele, Anja Jentzsch and Chris Bizer

Version: GnuPG/MacGPG2 v2.0.16 (Darwin)

Received on Friday, 3 June 2011 11:30:32 UTC

This archive was generated by hypermail 2.4.0 : Tuesday, 5 July 2022 08:45:25 UTC