W3C home > Mailing lists > Public > public-lod@w3.org > June 2012

Re: Knowledge Graph links to Freebase

From: Hugh Glaser <hg@ecs.soton.ac.uk>
Date: Sun, 3 Jun 2012 20:25:22 +0000
To: Andreas Thalhammer <andreas.thalhammer@sti2.at>
CC: Semantic Web <semantic-web@w3.org>, public-lod community <public-lod@w3.org>
Message-ID: <EMEW3|120a13be3eaeff53fe18f7067d25f0f8o52LPP02hg|ecs.soton.ac.uk|AEDC5435-6D51-4298-91A7-44A84233929D@ecs.soton.ac.uk>
Great detective work!
Actually, (apart from systems that don't have base64), I think you will find there are some characters at the start of the id file that include backspace - can't imagine why they would do that!

Anyway, as it is Sunday of a holiday weekend:
http://sameas.org/ggraph/?key=H4sIAAAAAAAAAONgVuLQz9U3MKs0LgIAXXSnTQwAAAA
(It's not pretty, as it probably isn't useful - if someone wants it to do something else I can change it).

Best
Hugh

On 3 Jun 2012, at 15:05, Andreas Thalhammer wrote:

> Dear all,
> 
> I want to take the opportunity to present my findings after analysing the patterns of Google's Knowledge Graph.
> 
> Each summary has a unique identifier. This identifier is used when linking to other entities, e.g.
> 
> H4sIAAAAAAAAAONgVuLQz9U3MKs0LgIAXXSnTQwAAAA
> 
> stands for the summary of Steve Jobs.
> 
> The URI to get the summary is
> https://www.google.com/search?hl=en&sa=X&q=steve+jobs&stick=H4sIAAAAAAAAAONgVuLQz9U3MKs0LgIAXXSnTQwAAAA
> 
> I found out that this key is created with 2 tools, namely base64 and gzip.
> 
> We can use this key to find out what the original content was (hoping to find a link to Freebase).
> 
> The way to go is the following:
> 
> 1. Store the identifier above (H4sl..) in a file, e.g. id.b64
> 2. console:$ base64 -d id.b64 > id.gz
> 3. console:$ gunzip id.gz
> 4. console:$ cat id
> 
> --> /m/06y3r
> 
> Now, add the freebase namespace to that:
> 
> http://rdf.freebase.com/ns/m/06y3r
> 
> This redirects to:
> 
> http://www.freebase.com/view/en/steve_jobs
> 
> Have a nice Sunday!
> 
> Andreas
> 
> -- 
> Andreas Thalhammer
> PhD Student
> Semantic Technology Institute
> University of Innsbruck
> http://www.sti2.at/
> 
> phone: +43 (0) 512507 6454
> email: andreas.thalhammer@sti2.at
> 
> 

-- 
Hugh Glaser,  
             Web and Internet Science
             Electronics and Computer Science,
             University of Southampton,
             Southampton SO17 1BJ
Work: +44 23 8059 3670, Fax: +44 23 8059 3045
Mobile: +44 75 9533 4155 , Home: +44 23 8061 5652
http://www.ecs.soton.ac.uk/~hg/
Received on Sunday, 3 June 2012 20:26:00 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:40 UTC