W3C home > Mailing lists > Public > public-lod@w3.org > April 2010

ANN: DBpedia 3.5.1 released

From: Robert Isele <robertisele@googlemail.com>
Date: Wed, 28 Apr 2010 18:38:22 +0200
Message-ID: <l2g4ac89111004280938if130b05fz579c8852377df416@mail.gmail.com>
To: dbpedia-discussion@lists.sourceforge.net
Cc: dbpedia-announcements@lists.sourceforge.net, public-lod@w3.org, SW-forum <semantic-web@w3.org>, semantic-web@yahoogroups.com
Hi all,

we are happy to announce the release of DBpedia 3.5.1.

This is primarily a bugfix release, which is based on Wikipedia dumps
dating from March 2010. Thanks to the great community feedback about
the previous DBpedia release, we were able to resolve the reported
issues as well as to improve template to ontology mappings.

The new release provides the following improvements and changes
compared to the DBpedia 3.5 release:

1. Some abstracts contained unwanted WikiText markup. The detection of
infoboxes and tables has been improved, so that even most pages with
syntax errors have clean abstracts now.
2. In 3.5 there has been an issue detecting interlanguage links, which
led to some non-english statements having the wrong subject. This has
been fixed.
3. Image references to dummy images (e.g.
http://en.wikipedia.org/wiki/Image:Replace_this_image.svg) have been
removed.
4. DBpedia 3.5.1 uses a stricter IRI validation now. Care has been
taken to only discard URIs from Wikipedia, which are clearly invalid.
5. Recognition of disambiguation pages has been improved, increasing
the size from 247,000 to 769,000 triples.
6. More geographic coordinates are extracted now, increasing its
number from 1,200,000 to 1,500,000 in the english version.
7. All Freebase links have been regenerated from the most recent freebase dump.

You can download the new DBpedia dataset from
http://wiki.dbpedia.org/Downloads351. As usual, the data set is also
available as Linked Data and via the DBpedia SPARQL endpoint.

Lots of thanks to:

* Jens Lehmann and Sören Auer (both Universität Leipzig) for providing
the knowledge base via the DBpedia download server at Universität
Leipzig.

* Kingsley Idehen and Mitko Iliev (both OpenLink Software) for loading
the knowledge base into the Virtuoso instance that serves the Linked
Data view and SPARQL endpoint.

The whole DBpedia team is very thankful to three companies which
enabled us to do all this by supporting and sponsoring the DBpedia
project:

* Neofonie GmbH (http://www.neofonie.de), a Berlin-based company
offering leading technologies in the area of Web search, social media
and mobile applications.

* Vulcan Inc. as part of its Project Halo
(http://www.projecthalo.com). Vulcan Inc. creates and advances a
variety of world-class endeavors and high impact initiatives that
change and improve the way we live, learn, do business
(http://www.vulcan.com).

* OpenLink Software (http://www.openlinksw.com). OpenLink Software
develops the Virtuoso Universal Server, an innovative enterprise grade
server that cost-effectively delivers an unrivaled platform for Data
Access, Integration and Management.

More information about DBpedia is found at http://dbpedia.org/About

Have fun with the new DBpedia knowledge base!

Cheers,

Robert Isele and Anja Jentzsch
Received on Thursday, 29 April 2010 08:43:50 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:26 UTC