Re: [css3-speech] Proposal: an aural box model

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Wed, 28 Jul 2004, csant wrote:

> 
> "The pause is inserted between the element's content and any
> 'cue-before' or 'cue-after' content." [1]
> 
> In my opinion there is a need to think about a new kind of *aural
> box model*, which (as far as i know) has not been defined yet, to
> be able to exactly understand how 'pause' works, how generated
> content is to be added with pseudo-elements, and to realize what
> we are missing in the spec.
> 
> Currently, a 'pause' is defined as "a pause or prosodic boundary
> to be observed before (or after) speaking an element's content". A
> 'cue' is defined as a sound to be "played before and/or after the
> element to delimit it". A 'pause' "is inserted between the
> element's content and any 'cue-before' or 'cue-after' content".
> 
> This describes a model that can be rendered visually in the
> following way:
> 
> cue-before . pause-before . <element> . pause-after . cue-after
> 
> and can be compared to the visual box model in a way that the
> 'cue' is the aural equivalent to 'border' and 'pause' is the
> equivalent to 'padding'.
> 
> Defining an aural box does also help determining where exactly
> generated content would be added with any pseudo-element.
> 
> The issue is that there is no aural equivalent to 'margin', i.e.
> there is no way to determine the interval of time between the
> 'cue-after' of an element and the 'cue-before' of the next
> element.

How important is this in practice, given the ability to include
silent preambles and postambles in the audio files for cues?
These compress efficiently and don't have much impact on file
size.

If there does turn out to be a strong need for cues to specify
periods of silence before and after, we could I suppose introduce
properties like cue-silence-before and cue-silence-after, but
this would be very messy if we tried to combine them into the
existing cue-before, cue-after and cue properties.


> 
> Regards,
> /c
> 
> ----
> [1] http://www.w3.org/TR/2004/WD-css3-speech-20040727/#pause-props
> 
> 

- -- 
 Dave Raggett <dsr@w3.org>  W3C lead for voice and multimodal.
 http://www.w3.org/People/Raggett +44 1225 866240 (or 867351)
 
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)

iD8DBQFBCkfPb3AdEmxAsUsRAgQgAJ9GYjITNvB8jVBnq6p+EJEAnwfdswCdExW/
38KQujS2cJ8OlDCUzThnMpc=
=r3vZ
-----END PGP SIGNATURE-----

Received on Friday, 30 July 2004 09:05:37 UTC