- From: Melvin Carvalho <melvincarvalho@gmail.com>
- Date: Mon, 4 Jun 2012 09:13:37 +0200
- To: Andreas Thalhammer <andreas.thalhammer@sti2.at>
- Cc: semantic-web@w3.org
- Message-ID: <CAKaEYhJZ7m0Bz_-UJ-Xzc5DXY6k8oMM9o3qxNDygXtQYaOpMZA@mail.gmail.com>
On 3 June 2012 16:05, Andreas Thalhammer <andreas.thalhammer@sti2.at> wrote: > Dear all, > > I want to take the opportunity to present my findings after analysing the > patterns of Google's Knowledge Graph. > > Each summary has a unique identifier. This identifier is used when linking > to other entities, e.g. > > H4sIAAAAAAAAAONgVuLQz9U3MKs0Lg**IAXXSnTQwAAAA > > stands for the summary of Steve Jobs. > > The URI to get the summary is > https://www.google.com/search?**hl=en&sa=X&q=steve+jobs&stick=** > H4sIAAAAAAAAAONgVuLQz9U3MKs0Lg**IAXXSnTQwAAAA<https://www.google.com/search?hl=en&sa=X&q=steve+jobs&stick=H4sIAAAAAAAAAONgVuLQz9U3MKs0LgIAXXSnTQwAAAA> > > I found out that this key is created with 2 tools, namely base64 and gzip. > > We can use this key to find out what the original content was (hoping to > find a link to Freebase). > > The way to go is the following: > > 1. Store the identifier above (H4sl..) in a file, e.g. id.b64 > 2. console:$ base64 -d id.b64 > id.gz > 3. console:$ gunzip id.gz > 4. console:$ cat id > > --> /m/06y3r > > Now, add the freebase namespace to that: > > http://rdf.freebase.com/ns/m/**06y3r <http://rdf.freebase.com/ns/m/06y3r> > > This redirects to: > > http://www.freebase.com/view/**en/steve_jobs<http://www.freebase.com/view/en/steve_jobs> > > Have a nice Sunday! > Nice find. I wonder what the motivation for this pattern is ... I tend to use sha1 in my hashes, but I guess that's lossy ... > > Andreas > > -- > Andreas Thalhammer > PhD Student > Semantic Technology Institute > University of Innsbruck > http://www.sti2.at/ > > phone: +43 (0) 512507 6454 > email: andreas.thalhammer@sti2.at > > >
Received on Monday, 4 June 2012 07:14:06 UTC