W3C home > Mailing lists > Public > public-openannotation@w3.org > February 2013

Re: Floating Quotable Citations (FQC)

From: David Cuenca <dacuetu@gmail.com>
Date: Wed, 20 Feb 2013 21:01:53 -0500
Message-ID: <CAJBSGSoefmoxZukB3Csmw95DEbxvtM5tG4gX4=4PuF8DPmhx2A@mail.gmail.com>
To: Robert Sanderson <azaroth42@gmail.com>
Cc: public-openannotation@w3.org
On Wed, Feb 20, 2013 at 1:43 PM, Robert Sanderson <azaroth42@gmail.com>wrote:

> However I'd encourage you to *not* try to do this as a URI Fragment,
> as you would be competing with the official specifications of what a
> Fragment component of HTML, plain text, XML and PDF resources means.
> Within Media Wiki and other conforming implementations you can, of
> course, use the query approach.
>

The idea was that, if seen appropriate, it could be converted into an
official specification.
That is for sure not up to me to decide, but it is path that could be
explored.


> Some other issues off the top of my head:
>
> * It's hard to determine paragraphs, sentences and words.
> -- Paragraphs could be <p>, or <div>, but they might not be.  Perhaps
> just <br/><br/> is used to separate the paragraphs.
> And that's just HTML, let alone other textual resources.
> -- Sentences:  Mr. J. Smith of the U.S.A. took $1.45 from his pocket
> ... and spent it.   1 sentence or 10?
> -- Words: Word splitting is extremely hard in eastern languages.
>

In the first case it is a matter of having accurate definitions of what a
paragraph means - I admit there are some loose ends there. The issue with
sentences could be averted defining a sentence as a group of at least 2
words and handling numerals properly. Nevertheless in an extreme situation
like that it would be much more sensible to use word counting instead. What
do you mean by word splitting in eastern languages? The concept of using
the unit "word"?

* We stuck with character counting, but even then it's tricky with
> normalization routines.  &amp;  -- 1 character or 5?
> You have the same issue with length as well.
>

My aim was only to address the topic of parsed text, would that be an issue
in that case?

Hope that helps!
>

I does help a lot! As I've said this is still in early stages so criticism
will make the framework stronger and it will properly define the boundaries
of how far can we get with this.
Thanks Rob for spending your time reviewing this proposal!

David
Received on Thursday, 21 February 2013 02:02:41 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:22:03 UTC