Re: Wikipedia and Geonames. was: AW: ANN: RDF Book Mashup - Integrating Web 2.0 data sources like Amazon and Google into the Semantic Web

> Some further ideas along these lines. What about scraping information 
> about geograpic places like countries and cities from Wikipedia and 
> linking the data to geonames (http://www.geonames.org/ontology/)?
We are already working on linking wikipedia articles with geonames 
places. See also this thread in October : 
http://lists.w3.org/Archives/Public/semantic-web/2006Oct/0148.html

Now that you are asking for it, I have released today a first version, 
which includes the following wikipedia information about Embrun :

<wikipediaArticle>http://fr.wikipedia.org/wiki/Embrun_%28Hautes-Alpes%29</wikipediaArticle>
<wikipediaArticle>http://pl.wikipedia.org/wiki/Embrun</wikipediaArticle>
<wikipediaArticle>http://de.wikipedia.org/wiki/Embrun</wikipediaArticle>
<wikipediaArticle>http://en.wikipedia.org/wiki/Embrun%2C_Hautes-Alpes</wikipediaArticle>
<wikipediaArticle>http://it.wikipedia.org/wiki/Embrun</wikipediaArticle>
<wikipediaArticle>http://nl.wikipedia.org/wiki/Embrun</wikipediaArticle>

Around 100,000 geonames place names now have wikipedia links.

> I once read about some pretty sophisticated screen-scraping frameworks
As far as I know crawling and screen scraping wikipedia is not 
considered fair use. Wikimedia software is rather resource intensive and 
the preferred way is to use the xml download files : 
http://download.wikipedia.org/

Cheers

Marc

Received on Sunday, 3 December 2006 22:05:26 UTC