W3C home > Mailing lists > Public > public-openannotation@w3.org > February 2013

Re: Floating Quotable Citations (FQC)

From: David Cuenca <dacuetu@gmail.com>
Date: Fri, 22 Feb 2013 00:32:14 -0500
Message-ID: <CAJBSGSptknAdwgQkR6fZk=Yw7DuO_RHQ6ksGwHBKVAnVoSpmoA@mail.gmail.com>
To: Tom Morris <tfmorris@gmail.com>
Cc: Karin Verspoor <Karin.Verspoor@nicta.com.au>, Robert Sanderson <azaroth42@gmail.com>, "<public-openannotation@w3.org>" <public-openannotation@w3.org>
On Fri, Feb 22, 2013 at 12:00 AM, Tom Morris <tfmorris@gmail.com> wrote:

>
> I think you're trying to do too much.  The standard in the paper world is
> a page number or range of page numbers (of a specific edition, I might
> add).  The page boundaries are preserved in the scanning process and
> certainly can be preserved in the OCR and post-processing.  The citation
> might not be letter accurate, but it will be as accurate as the original
> paper-world citation.
>
> One way people tighten the citation up is to quote a passage.  In that
> case, the quote can be matched against the newly OCR'd text.
>
> You're reaching for goal that may not be achievable.  I'd suggest
> simplifying.
>
>
Well, I suggested an ideal scenario that might be far-fetched. The
immediate goal is to be able to generate quote queries.
And I wish that page boundaries were always preserved, but if you take a
look to the transcriptions by Project Gutenberg you will see that they
aren't.

David
Received on Friday, 22 February 2013 05:33:01 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:38:22 UTC