W3C home > Mailing lists > Public > public-tt@w3.org > June 2003

Re: Annotation of Timed Text

From: John Hanna <j.hanna@snet.net>
Date: Fri, 27 Jun 2003 11:20:02 -0400
Message-ID: <00fa01c33cbf$9966ff30$0300a8c0@VAIOZ505JSK>
To: "Dave Singer" <singer@apple.com>, "Glenn A. Adams" <glenn@xfsi.com>, <public-tt@w3.org>

It seems that "annotation" and "meta-data" have quite a conceptual overlap,
but are not the same in current usage.

Annotations are included in XML Schema as full elements. TT Requirements
include meta-data elements that could be used for annotation, but it seems
that some annotations might not fit that model, for example, if an
annotation didn't reference a single element, but needed to say "element 1
doesn't agree with element 7". That could be shifted to an "external" form
of annotation document that just points to the multiple TT document contents
elements, but given that there wouldn't be reverse pointers, would it be
possible to display that external annotation material simultaneously in a
timed manner? If there was some sort of dynamic formation of a combined
document for timed display, wouldn't the structure need to support
distinctions between content and annotation anyway?

I note that MPEG-7 is mentioned in an editorial comment in TT Requirements.
Doesn't MPEG-7 do much of what Timed Text will do? It seems to address
annotation also. Possibly there should be more connection between the two.

John Hanna

----- Original Message -----
From: "Dave Singer" <singer@apple.com>
Sent: Thursday, June 26, 2003 12:13 PM
Subject: Re: Annotation of Timed Text

> At 12:06 -0400 6/25/03, John Hanna wrote:
> >On Tuesday, June 24, 2003 4:02 PM Glenn A. Adams said...
> >>  We haven't discussed annotating. Could you give some concrete
> >>  use case scenarios?
> Isn't "annotation" another word for "meta-data" or do you see these
> as distinct?  It seems that many domains may need domain-specific
> data associated with the timed-text.
> >
> >Consider the use of digital video (including audio) for research and
> >education. A video is observed and interpreted from a particular
> >perspective, and a Timed Text record made of the interpretation. The
> >could be a complex structure, akin to descriptive captioning, with
> >transcriptions of dialog, descriptions of movement and events, etc. This
> >would be displayed in a browser or media viewer panel synchronized with
> >video. For research, review comments about the interpretation would
> >the record and be presented in the timed display. For Education,
> >supplementary hints about how to view the scenes and pick out the
> >aspects, or even instructor notes, would annotate the record and
> >be presented in the timed display. Of course the record and annotations
> >could also be viewed in a non-timed manner. To emphasize the video
> >associated with the timed text, an overlay of tracking highlights could
> >video objects and be keyed (e.g., by color) to the text.
> >
> >Another example would be for an animated storyboard or movie, with the
> >screenplay presented in timed text (although there may not be a
> >DTD/schema yet). Annotations would pertain to hints or comments for all
> >different production roles, or for cinema studies.
> >
> >From a human factors perspective, having labels and descriptions closer
> >more directly associated with the video objects would be best. Possibly
> >timed text dialog should be displayable in a talk-balloon that moves with
> >the speaker. Is that within the scope of ways timed text can be
> >or is it moving into animation?
Received on Friday, 27 June 2003 11:12:44 UTC

This archive was generated by hypermail 2.3.1 : Thursday, 5 October 2017 18:23:59 UTC