- From: Ivan Herman <ivan@w3.org>
- Date: Sun, 19 Apr 2015 09:58:24 +0200
- To: Liam Quin <liam@w3.org>
- Cc: Randall Leeds <randall@bleeds.info>, Robert Sanderson <azaroth42@gmail.com>, Paolo Ciccarese <paolo.ciccarese@gmail.com>, W3C Public Annotation List <public-annotation@w3.org>, Bill Kasdorf <bkasdorf@apexcovantage.com>, Tzviya Siegman <tsiegman@wiley.com>, Markus Gylling <markus.gylling@gmail.com>
- Message-Id: <368BB21B-A303-493C-8F00-D05162A4C5E3@w3.org>
> On 18 Apr 2015, at 20:34 , Liam R. E. Quin <liam@w3.org> wrote: > > On Tue, 2015-04-14 at 11:38 +0200, Ivan Herman wrote: >>> On 13 Apr 2015, at 23:05 , Randall Leeds <randall@bleeds.info> >>> wrote: >>> >>> Rob's answer is much better than mine, but points to the same >>> solution, I think. >>> >>> If you control the media type and the meaning of fragments in that >>> context, then you don't need our permission to put an OA selector >>> in the fragment. >> >> I am not looking for a permission, I am looking for a coordination:-) > > > URI fragments are defined by media type registrations based on media > type. For example, the fragment identifier syntax for XML documents is > defined to be XPointer. > > This is arguably bad architecture, because it fails on content > negotiation. So when someone asked on the dpub call I probably should > have said "no, you can't use a URI for this". > > You can, however, invert it: > annotations://annotation-server.example.org/?uri=yyy;xpath=/book/chapter[3]//figure[@src='me.jpg']/ancestor::para/wordspan(5,17);mode- > highlight > > The annotation server could issue a redirect -- but a client-side > engine could simply rewrite this to an annotated URI (yyy). > > So I think there's scope for creativity. The discussion, if fact, originated from outside the annotation work; it is back to the web packaging[1] work that may become the new packaging format for future electronic document like ebooks. If the web packaging is adopted. The fragment identification in that document is a bit similar to what you describe, it would have something like http://example.org/downloads/editor.pack#url=/root.html;fragment=something_complex_here where the 'url' part of the fragment identifies a 'part' within a package (there are other parameters to help in filtering among various parts) and the value of the 'fragment' is, well, a fragment identifier *within* the identified part, according to the media type of that header. So the remark of Rob on the thread is justified: if we turn a selector into a fragment, it should be accepted/acceptable for a specific media type (which I believe is not an unsurmountable obstacle). Ivan [1] http://www.w3.org/TR/web-packaging/ [2] http://www.w3.org/TR/web-packaging/#fragment-identifiers > > >> Sure, we can define those URI fragments. Is it o.k. if a group just >> does that without coordinating with those who are behind the OA >> Selector model? I do not think so… >> >> Actually, I also wonder whether the serialization of the selectors >> in terms of URI-s would not come handy to our own deliverables, too >> (again, with possible restrictions as for the media types). Eg, >> handling URI-s that way with existing URI libraries in the various >> programming languages around us might be handy… (But, I admit, I am >> just handwaving here…) >> >> Ivan >> >> >>> >>> >>> On Mon, Apr 13, 2015, 11:04 Robert Sanderson <azaroth42@gmail.com> >>> wrote: >>> On Mon, Apr 13, 2015 at 10:07 AM, Ivan Herman <ivan@w3.org> wrote: >>> Hm. The problem is that there is a use case here that we may have >>> to accommodate somehow. >>> At the moment, if you take an Ebook, and you want to have a URI >>> identifying a specific position within a specific chapter of a >>> book, what you can use something like: >>> >>> http://www.example.org/book#epubcfi(/6/4[chap01ref]!/4[body01]/10[para05]/3:10) >>> >>> >>> Sure, because the Epub media type registration defines the meaning >>> of fragment section of URIs where Epub is a representation that >>> can be retrieved. >>> We can't legitimately just add #oa:Selector(...) to the end of the >>> URI instead. >>> >>> >>> epubcfi works, and is used, but it has its drawbacks (let me not >>> go into all the details). One drawback is that what it offers as >>> anchoring possibility though powerful) is way less flexible than >>> the selector model, primarily the range selectors. The conceptual >>> model behind those would become useful, as an alternative to >>> something like epubcfi, if those structures could be used as >>> fragments. >>> >>> Agreed, and the same applies for every other media type as well. >>> >>> >>> Maybe we have to restrict its usage and define it only for >>> specific media types (text, etc) to avoid the issues in your >>> example on genetic sequences or full blown graphics. But believe >>> something like that would be very useful and, for some >>> communities, necessary. >>> >>> The point from 3986 (and related) is that we _cannot_ define it >>> for specific media types unless we control them. It's summarized >>> in the first bullet in the annotation spec I linked to. >>> For example, the meaning of a fragment on a plain text document is >>> defined by 5147: https://tools.ietf.org/html/rfc5147 >>> >>> So we can't just say that people should use #oa:Selector(...) with >>> a plain text document (or any other format) :( >>> >>> Rob >>> >>> >>> >>> (b.t.w., I am not sure I understand your comment on RFC3986) >>> [1] http://www.idpf.org/epub/linking/cfi/epub-cfi.html >>> >>>> On 13 Apr 2015, at 18:28 , Robert Sanderson <azaroth42@gmail.com >>>>> wrote: >>>> >>>> >>>> We discussed fragments in the community group at length. >>>> >>>> The concerns about the approach are documented here: >>>> http://www.w3.org/TR/annotation-model/#fragment-uris >>>> >>>> These boil down to the fact that as you get more sophisticated >>>> selections the URI becomes unbearably long. >>>> Consider serializing an entire SVG document into the URI to >>>> specify a non rectangular area. Or selecting the previous and >>>> following 1024 Gs Cs As and Ts to select a range of text in a >>>> genetic sequence. >>>> >>>> My personal position is that selectors should not be turned into >>>> fragments, because (especially) that would break the rules of >>>> fragment identifiers as laid out in RFC 3986: >>>> >>>> The semantics of a fragment identifier are defined by the set of >>>> representations that might result from a retrieval action on the >>>> primary resource. >>>> As further discussed by JeniT here: >>>> http://www.w3.org/TR/fragid-best-practices/ >>>> >>>> Basically, unless there's a new text/HTML RFC that allows us to >>>> do it, we can't arbitrarily shove the description of the segment >>>> into its identity. >>>> >>>> Rob >>>> >>>> >>>> On Mon, Apr 13, 2015 at 9:10 AM, Ivan Herman <ivan@w3.org> >>>> wrote: (Although this may not be immediately relevant to the >>>> Working Group right now, I think the question *may* become >>>> relevant, hence my copy to it…) >>>> >>>> Rob, Paolo, >>>> >>>> a question came up at the Digital Publishing IG today. The IG is >>>> looking at general fragment identifiers for the purpose of >>>> identifying portions within a digital document (typically EPUB, >>>> but also some future versions of it). The Selector structure of >>>> the OA obviously gives a great model for various types of >>>> anchors, mainly when combined with other, existing fragment id >>>> definitions. >>>> >>>> However, at present, the selectors are defined in terms of RDF >>>> resources; to take an example from the spec, it says, for example >>>> >>>> selector": { >>>> "@id": "http://example.org/selector1", >>>> "@type": "oa:DataPositionSelector", >>>> "start": 4096, >>>> "end": 4104 >>>> } >>>> >>>> To be usable for a fragment identification, this structure >>>> should be turned into some sort of a, well, URI fragment. I >>>> mean, it is probably relatively easy to do this, something like >>>> >>>> http://www.example.org/#selector(type=DataPositionSelector,start=4096,end=4104) >>>> >>>> >>>> would do it but, of course, the ideal would be if that type of >>>> fragment format would be defined at one place. >>>> >>>> The question is: has this ever been discussed previously on the >>>> OA model? If it hasn't been done, should it be done? If it >>>> should be done, should it be done by this WG, or some other >>>> group? >>>> >>>> Thanks >>>> >>>> Ivan >>>> >>>> >>>> ---- >>>> Ivan Herman, W3C >>>> Digital Publishing Activity Lead >>>> Home: http://www.w3.org/People/Ivan/ >>>> mobile: +31-641044153 >>>> ORCID ID: http://orcid.org/0000-0003-0782-2704 >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> -- >>>> Rob Sanderson >>>> Information Standards Advocate >>>> Digital Library Systems and Services >>>> Stanford, CA 94305 >>> >>> >>> ---- >>> Ivan Herman, W3C >>> Digital Publishing Activity Lead >>> Home: http://www.w3.org/People/Ivan/ >>> mobile: +31-641044153 >>> ORCID ID: http://orcid.org/0000-0003-0782-2704 >>> >>> >>> >>> >>> >>> >>> >>> -- >>> Rob Sanderson >>> Information Standards Advocate >>> Digital Library Systems and Services >>> Stanford, CA 94305 >> >> >> ---- >> Ivan Herman, W3C >> Digital Publishing Activity Lead >> Home: http://www.w3.org/People/Ivan/ >> mobile: +31-641044153 >> ORCID ID: http://orcid.org/0000-0003-0782-2704 >> >> >> >> > ---- Ivan Herman, W3C Digital Publishing Activity Lead Home: http://www.w3.org/People/Ivan/ mobile: +31-641044153 ORCID ID: http://orcid.org/0000-0003-0782-2704
Received on Sunday, 19 April 2015 07:58:42 UTC