W3C home > Mailing lists > Public > semantic-web@w3.org > October 2014

Re: scientific publishing process (was Re: Cost and access)

From: Peter F. Patel-Schneider <pfpschneider@gmail.com>
Date: Mon, 06 Oct 2014 18:31:18 -0700
Message-ID: <54334266.2080001@gmail.com>
To: Kingsley Idehen <kidehen@openlinksw.com>, "semantic-web@w3.org" <semantic-web@w3.org>
On 10/06/2014 06:19 PM, Kingsley Idehen wrote:
> On 10/6/14 2:49 PM, Peter F. Patel-Schneider wrote:
>>
>>
>> On 10/06/2014 11:03 AM, Kingsley Idehen wrote:
>>> On 10/6/14 12:48 PM, Peter F. Patel-Schneider wrote:
>>>> It's not hard to query PDFs with SPARQL.  All you have to do is extract the
>>>> metadata from the document and turn it into RDF, if needed. Lots of programs
>>>> extract and display this metadata already.
>>>
>>> Peter,
>>>
>>> Having had 200+ (some-non-rdf-doc} to RDF document transformers built under my
>>> direct guidance, there are issues with your claim above:
>>
>> Huh?  Every single PDF reader that I use can extract the PDF metadata and
>> display it.
>
> Again, this isn't about metadata.

Well, my wording at the beginning of this exchange is about metadata.

>> The metadata that I see in PDF documents uses a core set of properties that
>> are easy to transform into RDF.
>
> Metadata isn't the issue at hand.

Look at the initial quote of this email message.
>
>> Of course, this core set is very small (title, author, and a few other
>> things) so you don't get all that much out of the core set.
>
> See my comments above



peter
Received on Tuesday, 7 October 2014 01:31:55 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 19:49:25 UTC