Minutes of TT WG Teleconference on 03/18/04 from Thierry MICHEL on 2004-04-26 (public-tt@w3.org from April 2004)

From: Thierry MICHEL <tmichel@w3.org>
Date: Mon, 26 Apr 2004 15:44:03 +0200
To: <public-tt@w3.org>
Message-ID: <000101c42b94$9aacb320$ad02010a@wistiti>
Minutes of TT WG Teleconference on 03/18/04

Attendees

  Glenn Adams (XFSI, Chair, Scribe) [GA]
  Sean Hayes, Microsoft (SH)
  David Kirby (BBC) [DK]

Regrets

  Mike Dolan (invited expert) [MD]
  Geoff Freed (WGBH/NCAM, scribe) [GF]
  Erik Hodge (RealNetworks) [EH]

Absent

  Thierry Michel (W3C) [TM]
  Dave Singer (Apple) [DS]

************************************************************************
Draft Agenda
************************************************************************

[GA] Apologies for not having opportunity to post draft agenda, due
to current travel.

************************************************************************
1. F2F Planning (Amsterdam)
************************************************************************

[GA] Recommends obtaining reservations quickly due to flower season.

[DK] CWI may have difficult public transport due to intermittent bus.

************************************************************************
2. Review Descriptive Vocabulary [DK]
************************************************************************

[DK] Reviews descriptive vocabulary items, with notes and examples
below:

Action ------

The text describes the action that is taking place. It is not uttered.

Example:

<p id="p1" agent="paul"> <span role="action">Pointing to nearby
rocks</span> Erratic boulders like this were found all over northern
Europe and America.  </p>

Caption -------

The text appears as a part of the video content; it is "open" or
"burnt-in" text that forms part of the picture content and as such
its display cannot be switched off by the user. 

Examples: i) a place or time to set the action for the viewer, e.g.
London, 1902

<span role="caption">London, 1902</span>

ii) a subtitle to translate brief foreign language speech, e.g. a
programme produced in English may have Japanese contributions within
it, each with English language subtitles added into the picture. 

<span role="caption">... </span> <!-- needs suitable caption text
adding here -->

Kinesic -------

Do we need this in timed text?

From TEI P4 CH11: any communicative phenomenon, not necessarily
vocalized, for example a gesture, frown, etc.

Music ----- The text describes the music content. Possible details
include composer, album title, track title, performer and recording
reference number, e.g. CD number. [Should we allow this to be
free-format text or provide markup for details such as composer?]

Example:

<span role="music" dur="2.48">Mozart: The Magic Flute</span>

Pause -----

A pause between any elements of text. Where this is associated with
an utterance, it is also attributed to the speaker of that utterance.
The duration of the pause can be specified using either the
additional attribute dur= with a time in seconds or descriptively
with the attribute length= taking one of the values short, medium or
long.

Example: <p id="p1" agent="angela"> What do you think of that!  </p>
<pause dur="3.5"/> <p id="p2" agent="laura"> It's...<pause
length="medium"> unbelievable.  </p>

Phrase ------ Do we need this as it seems to be a subdivision of
utterance?

Shift -----

Do we need this? We already have R305 - Style Parameters - Aural
(which includes, for example, pitch)

To quote TEI P4 CH11:

<shift> marks the point at which some paralinguistic feature of a
series of utterances by any one speaker changes.

The who attribute supplies an identifier for the speaker or group of
speakers whose shift in some feature is being noted. Its value is the
identifier of a <participant> or <participant.grp> element in the TEI
header.

feature (feature) a paralinguistic feature.

new (new state) specifies the new state of the paralinguistic feature
specified.

Many paralinguistic features (pitch, prominence, loudness, etc.)
characterize stretches of speech which are not co-extensive with
utterances or any of the other units discussed so far. One simple
method of encoding such units is simply to mark their boundaries. An
empty element called <shift> is provided for this purpose.

* <shift> marks the point at which some paralinguistic
  feature of a series of utterances by any one speaker changes.
  feature (feature) a paralinguistic feature.  new (new state)
  specifies the new state of the paralinguistic feature specified.

A <shift> element may appear within an utterance or a segment to mark
a significant change in the particular feature defined by its
attributes, which is then understood to apply to all subsequent
utterances for the same speaker, unless changed by a new shift for
the same feature in the same speaker. Intervening utterances by other
speakers do not normally carry the same feature. For example:

<u who="lb"><shift feature="loud" new="f"/>Elizabeth</u> <u
who="eb">Yes</u> <u who="lb"><shift feature="loud" new="normal"/>Come
and try this <pause/> <shift feature="loud" new="ff"/>come on</u>

Song ----

The text is the lyrics of a song.

Sound -----

A non-verbal utterance, for example, breaking glass, door slam,
explosion.

Example: <p id="p1" agent="john"> Look over there!  </p> <span
role="sound">Car brakes squeal</span> <p id="p2" agent="emma"> Quick!
Let's go!  </p>

Thought -------

An utterance that is presented, not as direct speech from the
speaker, but as the thoughts of that speaker. Presentation may be as
text, e.g. a "thought-bubble" or as a voice-over. 

Title -----

The text is the title of the programme or work.

ADD: this is a sub-category of caption?  similarly, "directed by",
etc.

Problem: this could escalate and we will end up trying to mark up all
the parts of a title sequence and end credits.

Translation -----------

The text is the translation of the associated text content. This is
intended as a means of providing a translation of particular words to
aid reuse rather than a translation of the complete text. [See also
"meaning" below]

Example needed here.

Meaning ------- ?? Should this be an attribute?

For translation purposes, it is frequently required to have a further
explanation of the meaning of slang or similar words, e.g. "Thanks a
bunch". The meaning attribute provides a way to add descriptive text
for this purpose.

Example: <p id="p1" agent="barry"> <span meaning="Used as a term \"Be
careful\"">Watch</span> your step!  </p>

Utterance ---------

A contiguous stretch of speech associated with one speaker. An
utterance always ends on a change of speaker but may also end where
there is a significant pause in the speech.

[Should we make utterance a subdivision of paragraph, otherwise they
seem to have the same meaning?]

Example (but the same as paragraph):

<utterance id="p1" agent="narrator"> An extraordinary battle against
nature began.  </uterance>

Vocalization (non-lexical) -------------------------- ?? - what was
this going to cover? 

Writing -------

The text is not spoken but revealed as written content, shown to the
viewewr in the picture. 

Example: silently revealing text to a contestant in a game-show.

************************************************************************
3. Action Items
************************************************************************

[GA] Asks all to review action items, so we can conduct full review at
next telecon.

************************************************************************
3. Next Meetings
************************************************************************

Normal weekly telecon schedule (Noon Eastern Time)

May 10-12 F2F Amsterdam (CWI); start 10:30AM on 10th, end 1800h on 12th
Jun 22-24 F2F Mt View, CA (Microsoft)

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
START SUMMARY
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

*** RESOLUTIONS ***

*** OPEN ACTION ITEMS ***

Action: [SH] Will investigate use of media queries in this context and
report back.

Action: [DS with help of Paul Nelson and Peter Lofting] Write RFC to
register appropriate opentype/truetype font types as MIME media types,
suggest model of "application/font-<font-type-name>", e.g.,
"application/font-truetype".

Action: [GA] Make proposal regarding use of Xlink vocabulary or
"src" attribute.

Action: [GF] Investigate whether to use IRIs instead of URIs?
Note: XPointer and Namespaces in 1.1 use IRIs?

Action: [SH] Investigate use of "role" vs "class" attribute.

Action: [GF] Investigate mechanism for cascading
semantics and whether to support cascading on either or both
logical and presentation flowed vocabularies.

Action: [GA] Draft new requirement on "Integrability"
in general terms that should not impact testing or implementation
requirements.

Action: [GA] incorporate agreed changes into TT-AF-1-0-REQ in
preparation for publishing final W3C Note.

Action: [SH] will review and propose subset of aural parameters
(see R305).

Action: [GA] Add figure showing logical structure anticipated
by requirements.

Action: [GA] Add note to R217 and R219 that shows use of data: URI
scheme.

Action: [TM] Find out if xpointer() scheme WD [1] is still being
progressed forward.

[1] http://www.w3.org/TR/xptr-xpointer/#document-order-notation

Action: [SH] to propose subset with extensions for use in CS
Profile.

Action: [GA]/[GF] investigate syntax for regions vis-a-vis style.

Action: [DK] To write up short paragraph on uses of
role tokens. Suggest removing or adding as he
progresses.

Action: [TM] to propose and define standard MD attributes.

Action: [*] Think about standard MD items (and write something
down and send to list so we can think about it too).

Action: [GA] Need to start planning for June meeting in Japan!

Action: [GA] Post edited draft minutes for previous F2F (02/04)
for review.

Action: [TM] consult with W3C Legal to determine whether requirements
exist to explicitly obtain disclosure from past and current public
comments that have resulted in requirements, etc., specifically,
requirement R307. Note that this question is in regard to current
TTWG status under CPP.

Action: [TM] review test suites developed by RDF WG, which focused on
authorial intent, to see if some concepts and methodology can be
reused in TTWG context.

Action: [TM] Will research CR exit criteria and make proposal.

Action: [SH] to draft standard ready text for subset of xquery.

Action: [GA] post example file to member reflector.

Action: [DS] Based on email of 01/14, integrate with fillBehavior as
described in example.

Action: [SH] make proposal regarding keeping or removing float style
parameters.

*** OPEN ISSUES ***

Issue: Whether to use XLink vocabulary, e.g., as used consistently by
SVG, or use "src" attribute as apparently will be done in XHTML2?

Issue: Whether to use IRIs instead of URIs? Note: XPointer and
Namespaces in 1.1 use IRIs?

Issue: Should we use "class" instead of "role"?

Issue: Probably want to permit in logical content mode the selection of
content based on generic XML features of non-TT namespace descriptive
markup, e.g., for applying style and timing semantics, in which case an
appropriate TT container element shall be implied based on nearest
ancestor TT namespace element.

Issue: Need to think about cascading semantics; how to express, how to
apply, etc. Possibly use CSS semantics here as well.

Issue (2004-03-05): Whether to allow block as immediate child of
inline? N.B. XSL-FO does allow this. [GA] showed example of renmoji
(horizontal block in vertical Japanese lines).

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
END SUMMARY
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Thierry MICHEL
W3C/ERCIM
Received on Monday, 26 April 2004 09:46:57 UTC