- From: Benjamin Young <byoung@bigbluehat.com>
- Date: Mon, 15 Oct 2018 20:33:02 +0000
- To: Christopher Blackwell <cwblackwell@gmail.com>, "public-openannotation@w3.org" <public-openannotation@w3.org>
- CC: Steven Harms <sgharms@stevengharms.com>
- Message-ID: <BN6PR06MB27700364053AEB3FCD25F28CB2FD0@BN6PR06MB2770.namprd06.prod.outlook.com>
It's likely best--given the vast array of options--that one store as many matching target expressions as one is able to generate at the time the annotation is recorded (or perhaps later with machines). As in... ``` { "target": [ "urn:cts:....", { "source": "urn:isbn:...", "selector": { ...some nifty new selector for physical dimensions, pages, etc... } ] } ``` There's also EPUB CFI's of course...and likely many more we've missed... >_> As this exploration goes along, if anyone wants to write these findings up on the wiki, that'd be super amazing: https://www.w3.org/community/openannotation/wiki/Main_Page Cheers! Benjamin -- http://bigbluehat.com/ http://linkedin.com/in/benjaminyoung ________________________________ From: Christopher Blackwell <cwblackwell@gmail.com> Sent: Saturday, October 13, 2018 5:39 PM To: public-openannotation@w3.org Cc: Steven Harms Subject: Re: Web annotations for physical texts Hi Steven, Some thoughts on your questions… CTS URNs are for machine-actionable identification and retrieval of passages of text, so their job really is different from that of a human-readable label. In our projects we use the plain text CEX format ( https://cite-architecture.github.io/citedx/CEX-spec-3.0.1/ ) for capturing data and loading it into services, and it is at that level that we can attach human-readable labels to works and editions. Here’s a link that will (after a short delay, the server seems a little slow today) deliver a passage of text, with a label attached (and linked commentaries and some other stuff): http://www.homermultitext.org/hmt-digital/index.html?urn=urn:cts:greekLit:tlg0012.tlg001.msA:1.1-1.5 As for citing a page of a book… CTS really is about _texts_ rather than _books_. A CTS-URN captures the semantics of a “text” defined as “an ordered hierarchy of citation objects”. For our texts, at least, pages in a physical edition constitute a structure orthogonal to the citation-hieararchy of a work. So I don’t think there is a low-friction way to bend CTS away from canonically citable (= citations independent of any particular expression of a text) texts to texts citable only by pages in a particular printed edition. We associate CTS texts with “pages”, but it involves quite a bit of integration. This might be way more than you want to get into, but to give an example… http://www.homermultitext.org/hmt-digital/index.html?urn=urn:cite2:hmt:msA.v1:12r The above is a URL that will display an object in an ordered collection of manuscript folios; "urn:cite2:hmt:msA.v1:12r” identifies folio 12-recto of a physical manuscript. And this is a record that identifies a graph of (a) a passage of text (CtsUrn), (b) a physical folio (Cite2Urn), and (c) a digital image mapping the passage on the folio: http://www.homermultitext.org/hmt-digital/index.html?urn=urn:cite2:hmt:va_dse.v1:il10 Cheers, Chris B. -- Christopher W. Blackwell The Louis G. Forgione University Professor Department of Classics Furman University On Oct 13, 2018, at 4:14 AM, Steven Harms <sgharms@stevengharms.com> wrote: Given two endorsements for CTS in short order, I read the description and it seemed intuitive and to cover the required specificity easily. As such: urn:cts:CTSNAMESPACE:WORK:PASSAGE@SUBREFERENCE Would become urn:cts:isbn:###:<PASSAGE> Pros: 1. Intuitive! Cons: 1. With ISBN we lose the human friendliness of say, “JK Rowling wrote HP&Philospher’s stone.” This can be remedied, of course, by a higher container holding human-friendly data, but it seems like an obvious nit to address. MLA and other citation schemes preserve this visibly in the citation. Question: 1. How to handle <PASSAGE> in a book? Pasting the full text seems onerous. To annotate passage p, I don’t want to have to type in passage p *and* my annotation. This would also set one afoul of copyright holders. Further, range offsets, while completely reasonable are not given generally outside of epic poetry or other classics. Certainly many e-readers make this calculation possible and that will surely be the correct scheme for annotations from that medium. However, my focus remains real books ;) The most common scheme for a popular book would be the page. The docs state, failing an offset: > A reference to an individual passage is formatted as dot-separated components representing one or more levels of the citation hierarchy defined in a CTS TextInventory for that work. Now for most popular works, there is no CTS TextInventory — to the best of my knowledge. So: is there a low-friction way to refer to a page? Thanks for the suggestions to now, Steven (Typos and blunders my own as i’m On vacation without access to a keyboard ;)) On Thu, Oct 11, 2018 at 3:54 AM Christopher Blackwell <cwblackwell@gmail.com<mailto:cwblackwell@gmail.com>> wrote: Dear Steven, The CTS URN might be helpful: http://cite-architecture.github.io/ctsurn/ Part of the CITE Architecture: http://cite-architecture.github.io<http://cite-architecture.github.io/> (Disclosure: This is a thing I’ve worked on over the years.) This blog post points to some live examples of real data integrated with CTS URNs: http://homermultitext.blogspot.com/2018/07/the-homer-multitext-microservice-homer.html If this looks at all interesting, please don’t hesitate to send along further questions. Cheers, Chris B. -- Christopher W. Blackwell The Louis G. Forgione University Professor Department of Classics Furman University On Oct 10, 2018, at 1:57 PM, Steven Harms <sgharms@stevengharms.com<mailto:sgharms@stevengharms.com>> wrote: Greetings, I am interested in creating annotations on physical books [1<https://stevengharms.com/research/semweb-topic/problem_statement/>]. As the name "web annotations" suggests, the default target of the Web Annotation Working Group would be, of course, to annotation IRI-referable targets with IRI-identifiable Annotations. 1. Is there a model whereby we could point to a physical resource in a URI / IRI format (and thus join the existing Web Annotation universe, *or* 2. Is there a framework that might support referring to physical books that I've simply not found 3. Or should I plan to use JSON-LD to create "forge my own path?" I hope to post an example of what #3 might look like, but I'd like to double check my understanding before engaging in in such an effort, tabula rasa. Regards, Steven [1]: https://stevengharms.com/research/semweb-topic/problem_statement/ -- Steven G. Harms PGP: E6052DAF<https://pgp.mit.edu/pks/lookup?op=get&search=0x337AF45BE6052DAF> -- Steven G. Harms PGP: E6052DAF<https://pgp.mit.edu/pks/lookup?op=get&search=0x337AF45BE6052DAF>
Received on Monday, 15 October 2018 20:33:30 UTC