- From: Glenn A. Adams <glenn@xfsi.com>
- Date: Wed, 29 Jan 2003 20:03:38 -0500
- To: "W3C TT Public" <public-tt@w3.org>
I would like to engender a discussion on the issue of the contexts
where timed text (TT) content may be used and how this will affect the
buffering and timing aspects of this content. If this was discussed in
detail in the task force, please let me know. Otherwise, perhaps we can
discuss it a bit here, since I see some need for clarity (at least in
my own mind) regarding the various high-level usage scenarios.
I envision TT being used in a number of different contexts, including:
* authoring systems, for purpose of having a common interchange format
amongst these systems;
* storage systems, for purpose of having a common static media format,
e.g., on DVD, VHS, CD, etc.;
* traditional distribution systems, such as those that deliver V/A/D
(video + audio + data) content or A/D (audio + data) from content
sources to terrestrial or cable or satellite station operators;
* traditional emission systems, such as from terrestrial, cable,
satellite operators to end-user receivers and terminal devices;
* online distribution systems, such as HTTP or RTP over Internet;
Among these various contexts, there appear to be a number of distinct
buffering and timing models:
(1) NON-STREAMING, SYNCHRONOUS: timing information is explicit, and not
(necessarily) intended to be synchronized with any other media;
Example: RealText, QuickText, etc., where it is the only media
track or is asynchronous with respect to other tracks, and where
it is pre-delivered as a single resource.
(2) NON-STREAMING, SYNCHRONIZED: timing information is explicit, and is
explicitly linked to a time base in another media;
Example: RealText, QuickText, etc., where it is the only media
track or is asynchronous with respect to other tracks, and where
it is pre-delivered as a single resource.
(3) STREAMING, SYNCHRONOUS: timing information is explicit, and not
(necessarily) intended to be synchronized with any other media;
Example: use of an MPEG PES (packetized elementary stream) with
presentation time stamps to deliver synchronized data, where each
data access unit is associated with a presentation time stamp
locked to a program clock reference carried in that stream.
(4) STREAMING, SYNCHRONIZED: timing information is explicit, and is
explicitly linked to a time base in another media;
Example: use of an MPEG PES (packetized elementary stream) with
presentation time stamps to deliver synchronized data, where each
data access unit is associated with a presentation time stamp
locked to a program clock reference carried elsewhere.
(5) STREAMING, ISOCHRONOUS: timing information is implicit,
and is determined by the transport or the envelope;
Example: use of SMPTE 292M with ancillary data to carry closed
captioning.
Example: use of NTSC/PAL VBI to carry EIA-608 or MPEG-2 Video
Elementary Stream User Data to carry EIA-708.
So, I have some questions:
1. Is the above characterization of streaming vs. non-streaming and
synchronous vs. synchronized vs. isochronous a correct and complete
characterization of the various usage contexts? [Note that I am
only looking at buffering and timing aspects, and not others
like semantics and style.]
2. Can (and should) the models described above be simplified and/or
generalized?
3. Given that we are committed to defining an XML based syntax for
TT, how should we deal with the need to stream TT content? Are there
any precedents in W3C specs for streaming XML content? (e.g., does
XMLP or SOAP address these issues?) I am aware of the BiM format
used with MPEG-7; however, there may be essential IPR claims from
Expway on this format. Should we consider this or attempt to design
something like it?
Well, I hope this will spark a useful discussion, at least one that I
can learn from.
Regards,
Glenn
Received on Wednesday, 29 January 2003 19:55:40 UTC