- From: Mark Fallu <m.fallu@griffith.edu.au>
- Date: Sat, 11 Oct 2014 13:03:02 +1000
- To: Phillip Lord <phillip.lord@newcastle.ac.uk>
- Cc: Simon Spero <sesuncedu@gmail.com>, "Gray, Alasdair" <A.J.G.Gray@hw.ac.uk>, "<semantic-web@w3.org>" <semantic-web@w3.org>, "Peter F. Patel-Schneider" <pfpschneider@gmail.com>, Mark Diggory <mdiggory@gmail.com>, W3C LOD Mailing List <public-lod@w3.org>
Hi Phil, Nice work on Greycite - it looks like a very useful utility. Is the sourcecode for Greycite available? Cheers, Mark Sent from my iPhone > On 9 Oct 2014, at 9:56 pm, Phillip Lord <phillip.lord@newcastle.ac.uk> wrote: > > > > > Simon Spero <sesuncedu@gmail.com> writes: > >>> On Oct 8, 2014 10:15 AM, "Gray, Alasdair" <A.J.G.Gray@hw.ac.uk> wrote: >>> >>> Or is that because they want to import it into their own reference >> management system, e.g. Mendeley, which does not support the HTML version? >> >> 1. It is quite easy to embedded metadata in HTML pages in forms designed >> for accurate importing into reference managers (Hellman 2009). Mendeley has >> been known to have problems with imports in cases where a proxy server is >> involved. > > Myself and Lindsay Marshall have done a fair amount of work extracing > metadata from HTML for purposes of citation. With a fair amount of > heuristics, we can get enough metadata for a full citation from about > 60% of what you might call serious websites (i.e. those with technical > content). The general web is lower (about 1%) but most of the web > appears to be chinese pornography. > > This is available as a tool at http://greycite.knowledgeblog.org/. > > And fuller description is available at http://arxiv.org/abs/1304.7151. > > Phil >
Received on Saturday, 11 October 2014 03:03:32 UTC