W3C home > Mailing lists > Public > public-lod@w3.org > May 2013

Re: Final CFP: In-Use Track ISWC 2013

From: Sebastian Hellmann <hellmann@informatik.uni-leipzig.de>
Date: Thu, 02 May 2013 17:21:21 +0200
Message-ID: <51828471.8090205@informatik.uni-leipzig.de>
To: Paul Groth <p.t.groth@vu.nl>
CC: Sarven Capadisli <info@csarven.ca>, Linking Open Data <public-lod@w3.org>
Hi Paul,
personally for me latex works best, because it has good editors and 
support for description logic formulas. Plus it is widely used and quite 
good for PDF typesetting.

It would be really swell to be able to address content within PDF with 
identifiers. Did Utopia solve that problem?

I am asking along the lines of
- mediafragments [1]
- RFC 5147 text fragment identifier (see the example at the bottom of [2])
- xpointer/xpath [3]

If yes, I would like to use it immediately. There are plans to convert 
the Google Mention corpus (which includes PDF's) to NIF [2] .
The PDF Open Parameters provided by [4] are way too simple.

All the best,
Sebastian

[1] http://www.w3.org/TR/media-frags/
[2] (example is at the bottom of .ttl file) 
http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core
[3] e.g. 
http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/span[1]/text()[1])
[4] 
http://partners.adobe.com/public/developer/en/acrobat/PDFOpenParameters.pdf#page=7

Am 02.05.2013 12:55, schrieb Paul Groth:
> Hi Sarven,
>
> Beyond the PDF for me is moving beyond the current research 
> communication system as highlighted by the Force 11 manifesto 
> (http://www.force11.org/white_paper). This includes adopting 
> technologies that augment/extend (i.e. go beyond) existing 
> technologies. For example, making data easily accessible and citable, 
> providing links to online content, making multiple perspectives on 
> content available, exposing provenance, using altmetrics. I'm very 
> influenced by the work on Utopia (http://utopiadocs.com) so that's why 
> I think using pdfs are fine - you can do a lot with them as they stand 
> - and for a certain form of communication (written long form text) 
> they work well. As technologist we need to make sure that these new 
> technologies work well in the environment and connect to other things.
>
> cheers
> Paul
>
>
>
>
>
>
>
>
> On Thu, May 2, 2013 at 12:32 PM, Sarven Capadisli <info@csarven.ca 
> <mailto:info@csarven.ca>> wrote:
>
>     On 05/02/2013 12:23 PM, Paul Groth wrote:
>
>         I think Harry makes the point better than I can.
>
>
>     Paul, I have one last question for you if you don't mind, because
>     it seems like you are not interested in playing this out and I
>     don't want to bother you further: what does "beyond the PDF" mean
>     to you?
>
>     -Sarven
>
>
>
>
> -- 
> -----------------------------------------------------------------------------------
> Dr. Paul Groth (p.t.groth@vu.nl <mailto:p.t.groth@vu.nl>)
> http://www.few.vu.nl/~pgroth/ <http://www.few.vu.nl/%7Epgroth/>
> Assistant Professor
> - Web & Media Group | Department of Computer Science
> - The Network Institute
> VU University Amsterdam


-- 
Dipl. Inf. Sebastian Hellmann
Department of Computer Science, University of Leipzig
Events: NLP & DBpedia 2013 (http://nlp-dbpedia2013.blogs.aksw.org, 
Deadline: *July 8th*)
Venha para a Alemanha como PhD: http://bis.informatik.uni-leipzig.de/csf
Projects: http://nlp2rdf.org , http://linguistics.okfn.org , 
http://dbpedia.org/Wiktionary , http://dbpedia.org
Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
Research Group: http://aksw.org
Received on Thursday, 2 May 2013 15:22:00 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:21:44 UTC