W3C home > Mailing lists > Public > public-openannotation@w3.org > February 2016

Definition of "character" position for oa:start and oa:end

From: Mark Buer <mark.buer@booktrack.com>
Date: Mon, 22 Feb 2016 08:21:16 +1030
Message-ID: <CACVKchTP7rm37TjsAzS3MnDtNAuFGn50hS88k=xtdpCnkCEs=A@mail.gmail.com>
To: public-openannotation@w3.org
Is there ambiguity in the definition of "character" for a Text Position
Selector
<http://www.openannotation.org/spec/core/20130208/specific.html#TextPositionSelector>
?

Are positions given as:

   - UTF code points,
   - UTF-8 code units,
   - UTF-16 code units or
   - some other definition of "character"?


Interoperability between implementations may suffer if the definition of
"character" is not stated explicitly.


I recently raised a similar issue
<https://github.com/IDPF/epub-revision/issues/555> (with some useful links
to other standards) against the epubcfi specification
<http://www.idpf.org/epub/linking/cfi/epub-cfi.html>. The resolution in
that case was "UTF-16 code units", which is unsurprising given the
web-technology nature of that specification.
Received on Monday, 22 February 2016 11:21:28 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:38:28 UTC