W3C home > Mailing lists > Public > public-ar@w3.org > September 2011

Fwd: Multimedia User Interfaces: Update

From: ya knygar <knygar@gmail.com>
Date: Thu, 1 Sep 2011 23:17:17 +0000
Message-ID: <CAJVWO9Yjy5Qyr1kDFEH9q-nnPLewpQbUOFeOn5FOtqQ6Q8EJ8Q@mail.gmail.com>
To: W3C AR CG <public-ar@w3.org>
Regarding those multimedia user interface ideas, here are some
examples to clarify.

Regarding point one, a spatial selection is selecting a rectangle of a
video. By itself, a spatial selection is a subrectangle for the entire
multimedia object's duration:


A temporal selection is selecting, perhaps making use of the timeline,
a portion or interval of a multimedia object. By itself, a temporal
selection is for the entire movie's rectangle:


Combining those, selecting a rectangle of the multimedia object and an
interval of it, simultaneously, is a spatiotemporal selection:


Point two, or bookmarking, is about placing points of interest on a
multimedia object's timeline, for later use, without having to pause
the multimedia user experience.

Point three, observing the URI's for spatial, temporal and
spatiotemporal media fragments, is about starting from one of those,
as per <video src="http://example.com/
video.avi#xywh=160,120,320,240&t=10,20"/>, and being able to navigate
to either larger rectangles, wider intervals, both, or to the
video.avi object. Zooming also includes from a media fragment, such as
http://example.com/video.avi#t=10,20, to a containing structural
element, for example http://example.com/video.avi#id=chapter-1.

Point four, videos also have tracks and, in such tracks, are possible
structures beyond lists of chapters. Possible are structures like
books, parts, chapters, pages, paragraphs and sentences. It is
possible to select a structural element of a video.


With such structural tracks, people can traverse multimedia objects in
structure-based ways, as per from a point in a multimedia object to

The fifth idea, is about client-side text searching into videos.  Many
document viewers and web browsers provide searching into documents for
text occurances and that functionality is described as possible to
extend into the multimedia objects in those documents.  Client-side
text-based multimedia search can be facilitated by processing the
tracks that accompany multimedia objects, such as transcripts or
captions, and by audio and natural language processing techniques.

Kind regards,

Adam Sobieski
dev-platform mailing list
Received on Thursday, 1 September 2011 23:24:36 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 19:43:53 UTC