- From: Antoine Zimmermann <antoine.zimmermann@insa-lyon.fr>
- Date: Wed, 19 Jan 2011 17:03:28 +0100
- To: Chris Bizer <chris@bizer.de>
- CC: dbpedia-announcements@lists.sourceforge.net, dbpedia-discussion@lists.sourceforge.net, 'Semantic Web' <semantic-web@w3.org>, 'public-lod' <public-lod@w3.org>
Haha, I went to that page but immediately skipped the header and went downward to the datasets links... Thanks a lot. Regards, AZ. Le 19/01/2011 16:58, Chris Bizer a écrit : > Hi Antoine, > >> I was wondering: do you keep older versions of the DBpedia datasets? > > we do and they are all reachable from the DBpedia download page at > > http://wiki.dbpedia.org/Downloads36 > > Just click on the "Older Versions" links. > > Cheers, > > Chris > > >> -----Ursprüngliche Nachricht----- >> Von: public-lod-request@w3.org [mailto:public-lod-request@w3.org] Im >> Auftrag von Antoine Zimmermann >> Gesendet: Mittwoch, 19. Januar 2011 16:49 >> An: Chris Bizer >> Cc: dbpedia-announcements@lists.sourceforge.net; dbpedia- >> discussion@lists.sourceforge.net; 'Semantic Web'; 'public-lod' >> Betreff: Re: ANN: DBpedia 3.6 released >> >> Dear Chris and the DBpedia crew, >> >> >> As always, a new version of DBpedia is very good news for the Semantic >> Web and Linked Data. >> >> I was wondering: do you keep older versions of the DBpedia datasets? >> If yes, would you allow people to download older versions for research >> purposes? >> >> This would be very useful in order to study the dynamics of RDF data, or >> the dynamics of DBpedia itself. There are already papers on the dynamics >> of Wikipedia but I am not aware of corresponding work for DBPedia. >> >> >> Regards, >> AZ. >> >> Le 17/01/2011 14:10, Chris Bizer a écrit : >> > Hi all, >> > >> > we are happy to announce the release of DBpedia 3.6. The new release is >> > based on Wikipedia dumps dating from October/November 2010. >> > >> > The new DBpedia dataset describes more than 3.5 million things, of > which >> > 1.67 million are classified in a consistent ontology, including 364,000 >> > persons, 462,000 places, 99,000 music albums, 54,000 films, 16,500 > video >> > games, 148,000 organizations, 148,000 species and 5,200 diseases. >> > >> > The DBpedia dataset features labels and abstracts for 3.5 million >> things in >> > up to 97 different languages; 1,850,000 links to images and 5,900,000 >> links >> > to external web pages; 6,500,000 external links into other RDF >> datasets, and >> > 632,000 Wikipedia categories. >> > >> > The dataset consists of 672 million pieces of information (RDF >> triples) out >> > of which 286 million were extracted from the English edition of > Wikipedia >> > and 386 million were extracted from other language editions and links > to >> > external datasets. >> > >> > Along with the release of the new datasets, we are happy to announce > the >> > initial release of the DBpedia MappingTool >> > (http://mappings.dbpedia.org/index.php/MappingTool): a graphical user >> > interface to support the community in creating and editing mappings >> as well >> > as the ontology. >> > >> > The new release provides the following improvements and changes >> compared to >> > the DBpedia 3.5.1 release: >> > >> > 1. Improved DBpedia Ontology as well as improved Infobox mappings >> using >> > http://mappings.dbpedia.org/. >> > >> > Furthermore, there are now also mappings in languages other than >> English. >> > These improvements are largely due to collective work by the community. >> > There are 13.8 million RDF statements based on mappings (11.1 million > in >> > version 3.5.1). All this data is in the /ontology/ namespace. Note >> that this >> > data is of much higher quality than the Raw Infobox data in the >> /property/ >> > namespace. >> > >> > Statistics of the mappings wiki on the date of release 3.6: >> > >> > + Mappings: >> > + English: 315 Infobox mappings (covers 1124 templates including >> > redirects) >> > + Greek: 137 Infobox mappings (covers 192 templates including >> > redirects) >> > + Hungarian: 111 Infobox mappings (covers 151 templates including >> > redirects) >> > + Croatian: 36 Infobox mappings (covers 67 templates including >> > redirects) >> > + German: 9 Infobox mappings >> > + Slovenian: 4 Infobox mappings >> > + Ontology: >> > + 272 classes >> > + Properties: >> > + 629 object properties >> > + 706 datatype properties (they are all in the /datatype/ >> namespace) >> > >> > 2. Some commonly used property names changed. >> > >> > + Please see http://dbpedia.org/ChangeLog and >> > http://dbpedia.org/Datasets/Properties to know which relations >> changed and >> > update your applications accordingly! >> > >> > 3. New Datatypes for increased quality in mapping-based properties >> > >> > + xsd:positiveInteger, xsd:nonNegativeInteger, xsd:nonPositiveInteger, >> > xsd:negativeInteger >> > >> > 4. Improved parsing coverage. >> > >> > + Parsing of lists of elements in Infobox property values that >> improves the >> > completeness of extracted facts. >> > + Method to deal with missing repeated links in Infoboxes that do > appear >> > somewhere else on the page. >> > + Flag templates are parsed. >> > + Various improvements on internationalization. >> > >> > 5. Improved recognition of >> > >> > + Wikipedia namespace identifiers. >> > + Wikipedia language codes. >> > + Category hierarchies. >> > >> > 6. Disambiguation links for acronyms (all upper-case title) are now >> > extracted (for example, Kilobyte and Knowledge_base for "KB"): >> > >> > + Wikilinks consisting of multiple words: If the starting letters of > the >> > words appear in correct order (with possible gaps) and cover all > acronym >> > letters. >> > + Wikilinks consisting of a single word: If the case-insensitive > longest >> > common subsequence with the acronym is equal to the acronym. >> > >> > 7. Encoding (bugfixes): >> > >> > + The new datasets support the complete range of Unicode code points >> (up to >> > 0x10ffff). 16-bit code points start with '\u', code points larger than >> > 16-bits start with '\U'. >> > + Commas and ampersands do not get encoded anymore in URIs. Please >> see >> > http://dbpedia.org/URIencoding for an explanation regarding the >> DBpedia URI >> > encoding scheme. >> > >> > 8. Extended Datasets: >> > >> > + Thanks to Johannes Hoffart (Max-Planck-Institut für Informatik) for >> > contributing links to YAGO2. >> > + Freebase links have been updated. They now refer to mids >> > (http://wiki.freebase.com/wiki/Machine_ID) because guids have been >> > deprecated. >> > >> > You can download the new DBpedia dataset from >> http://dbpedia.org/Downloads36 >> > >> > As usual, the dataset is also available as Linked Data and via the >> DBpedia >> > SPARQL endpoint at http://dbpedia.org/sparql >> > >> > Lots of thanks to: >> > >> > + All editors that contributed to the DBpedia ontology mappings via the >> > Mappings Wiki. >> > + Max Jakob (Freie Universität Berlin, Germany) for improving the > DBpedia >> > extraction framework and for extracting the new datasets. >> > + Robert Isele and Anja Jentzsch (both Freie Universität Berlin, > Germany) >> > for helping Max with their expertise on the extraction framework. >> > + Paul Kreis (Freie Universität Berlin, Germany) for analyzing the >> DBpedia >> > data of the previous release and suggesting ways to increase quality > and >> > quantity. Some results of his work were implemented in this release. >> > + Dimitris Kontokostas (Aristotle University of Thessaloniki, >> Greece), Jimmy >> > O'Regan (Eolaistriu Technologies, Ireland), José Paulo Leal >> (University of >> > Porto, Portugal) for providing patches to improve the extraction >> framework. >> > + Jens Lehmann and Sören Auer (both Universität Leipzig, Germany) for >> > providing the new dataset via the DBpedia download server at > Universität >> > Leipzig. >> > + Kingsley Idehen and Mitko Iliev (both OpenLink Software) for >> loading the >> > dataset into the Virtuoso instance that serves the Linked Data view and >> > SPARQL endpoint. OpenLink Software (http://www.openlinksw.com/) >> altogether >> > for providing the server infrastructure for DBpedia. >> > >> > The work on the new release was financially supported by: >> > >> > + Neofonie GmbH, a Berlin-based company offering leading technologies >> in the >> > area of Web search, social media and mobile applications >> > (http://www.neofonie.de/). >> > + The European Commission through the project LOD2 - Creating >> Knowledge out >> > of Linked Data (http://lod2.eu/). >> > + Vulcan Inc. as part of its Project Halo > (http://www.projecthalo.com/). >> > Vulcan Inc. creates and advances a variety of world-class endeavors >> and high >> > impact initiatives that change and improve the way we live, learn, do >> > business (http://www.vulcan.com/). >> > >> > More information about DBpedia is found at http://dbpedia.org/About >> > >> > Have fun with the new dataset! >> > >> > The whole DBpedia team also congratulates Wikipedia to its 10th > Birthday >> > which was this weekend! >> > >> > Cheers, >> > >> > Chris Bizer >> > >> > >> > -- >> > Prof. Dr. Christian Bizer >> > Web-based Systems Group >> > Freie Universität Berlin >> > +49 30 838 55509 >> > http://www.bizer.de >> > chris@bizer.de >> > >> > >> > >> >> >> -- >> Antoine Zimmermann >> Researcher at: >> Laboratoire d'InfoRmatique en Image et Systèmes d'information >> Database Group >> 7 Avenue Jean Capelle >> 69621 Villeurbanne Cedex >> France >> Lecturer at: >> Institut National des Sciences Appliquées de Lyon >> 20 Avenue Albert Einstein >> 69621 Villeurbanne Cedex >> France >> antoine.zimmermann@insa-lyon.fr >> http://zimmer.aprilfoolsreview.com/ > -- Antoine Zimmermann Researcher at: Laboratoire d'InfoRmatique en Image et Systèmes d'information Database Group 7 Avenue Jean Capelle 69621 Villeurbanne Cedex France Lecturer at: Institut National des Sciences Appliquées de Lyon 20 Avenue Albert Einstein 69621 Villeurbanne Cedex France antoine.zimmermann@insa-lyon.fr http://zimmer.aprilfoolsreview.com/
Received on Wednesday, 19 January 2011 16:04:03 UTC