- From: Peter F. Patel-Schneider <pfpschneider@gmail.com>
- Date: Mon, 06 Oct 2014 18:31:18 -0700
- To: Kingsley Idehen <kidehen@openlinksw.com>, "semantic-web@w3.org" <semantic-web@w3.org>
On 10/06/2014 06:19 PM, Kingsley Idehen wrote: > On 10/6/14 2:49 PM, Peter F. Patel-Schneider wrote: >> >> >> On 10/06/2014 11:03 AM, Kingsley Idehen wrote: >>> On 10/6/14 12:48 PM, Peter F. Patel-Schneider wrote: >>>> It's not hard to query PDFs with SPARQL. All you have to do is extract the >>>> metadata from the document and turn it into RDF, if needed. Lots of programs >>>> extract and display this metadata already. >>> >>> Peter, >>> >>> Having had 200+ (some-non-rdf-doc} to RDF document transformers built under my >>> direct guidance, there are issues with your claim above: >> >> Huh? Every single PDF reader that I use can extract the PDF metadata and >> display it. > > Again, this isn't about metadata. Well, my wording at the beginning of this exchange is about metadata. >> The metadata that I see in PDF documents uses a core set of properties that >> are easy to transform into RDF. > > Metadata isn't the issue at hand. Look at the initial quote of this email message. > >> Of course, this core set is very small (title, author, and a few other >> things) so you don't get all that much out of the core set. > > See my comments above peter
Received on Tuesday, 7 October 2014 01:31:55 UTC