- From: Andreas Thalhammer <andreas.thalhammer@sti2.at>
- Date: Sun, 03 Jun 2012 16:05:13 +0200
- To: semantic-web@w3.org
Dear all, I want to take the opportunity to present my findings after analysing the patterns of Google's Knowledge Graph. Each summary has a unique identifier. This identifier is used when linking to other entities, e.g. H4sIAAAAAAAAAONgVuLQz9U3MKs0LgIAXXSnTQwAAAA stands for the summary of Steve Jobs. The URI to get the summary is https://www.google.com/search?hl=en&sa=X&q=steve+jobs&stick=H4sIAAAAAAAAAONgVuLQz9U3MKs0LgIAXXSnTQwAAAA I found out that this key is created with 2 tools, namely base64 and gzip. We can use this key to find out what the original content was (hoping to find a link to Freebase). The way to go is the following: 1. Store the identifier above (H4sl..) in a file, e.g. id.b64 2. console:$ base64 -d id.b64 > id.gz 3. console:$ gunzip id.gz 4. console:$ cat id --> /m/06y3r Now, add the freebase namespace to that: http://rdf.freebase.com/ns/m/06y3r This redirects to: http://www.freebase.com/view/en/steve_jobs Have a nice Sunday! Andreas -- Andreas Thalhammer PhD Student Semantic Technology Institute University of Innsbruck http://www.sti2.at/ phone: +43 (0) 512507 6454 email: andreas.thalhammer@sti2.at
Received on Sunday, 3 June 2012 14:05:46 UTC