- From: Robert Sanderson <azaroth42@gmail.com>
- Date: Fri, 22 Feb 2013 09:49:32 -0700
- To: David Cuenca <dacuetu@gmail.com>
- Cc: "<public-openannotation@w3.org>" <public-openannotation@w3.org>
It seems like the thing that's the target of the Annotation/citation is really very different in the various cases. Character counting in a print book would be a nightmare, of course, and that's why page references are so obviously important. On the other hand, page references don't exist in some digital copies. My suggestion, in Annotation speak, would be to have multiple targets with different selectors for the different expressions of the work. Then you could use systems appropriate for print with the print copies, and systems appropriate for digital with the digital copies, but the annotation/citation maintains the same identifier. Otherwise just try to record as much information as possible, and let future systems sort it out as best they can :) Rob On Fri, Feb 22, 2013 at 6:52 AM, David Cuenca <dacuetu@gmail.com> wrote: > On Fri, Feb 22, 2013 at 12:55 AM, Tom Morris <tfmorris@gmail.com> wrote: >> >> PG does all kinds of weird stuff. They insisted on 7-bit ASCII for ages >> after everyone else moved to ISO Latin-1. They strip all edition >> information claiming that they are creating new editions (which means none >> of the citations would be any good anyway since you can't match them up with >> the correct edition). >> >> If you look at the millions of books of PD books in the Internet Archive, >> HathiTrust, Google Books, etc, you'll see that they certainly do include >> page information. It's only the few thousand in the quirky Project Gutenburg >> which don't (and even PG has that information at the beginning of the >> process until they intentionally throw it away). > > > It is not a PG issue only, there are many other digital libraries that don't > signal page breaks or don't use any standard method to indicate it. Even in > Wikisource there are many transcribed texts that do mention the edition but > have no information about the pagination. One possible solution could be to > have several scoping options (default:whole document, page number, css > fragment, pararagraph+delimiter, etc) and then use a finer text selection on > that area (character count or quote selector). > > Btw, if anyone has a contact in PG, I'd love to talk with them. > > David
Received on Friday, 22 February 2013 16:50:00 UTC