W3C home > Mailing lists > Public > www-voice@w3.org > January to March 2003

Re: Critical missing feature in SSML specification

From: David Poehlman <poehlman1@comcast.net>
Date: Wed, 29 Jan 2003 13:47:11 -0500
To: "Shires, Glen" <glen.shires@intel.com>, www-voice@w3.org
Cc: w3c-wai-pf@w3.org
Message-id: <008501c2c7c6$dcf9b6c0$6501a8c0@handsontech>

I view ssl mark up in the same way that I view html or xml mark up.  The
user agent retrieves it and from there it is under user agent controll.  The
voice browser in this instance would have to have the capability of
manipulating the mark up in the same way s other agents manipulate html or
xml.  While I understand a requirement for a full stop, it must be in post
get since it could most likely be of no benefit in pre-get or in the data
set.  In the case of streaming, it is still a function of another layer
which exercises controll.  I would encourage that this idea be kept but
enforced in a context where it can have effect.

----- Original Message -----
From: "Shires, Glen" <glen.shires@intel.com>
To: <www-voice@w3.org>
Cc: <w3c-wai-pf@w3.org>
Sent: Wednesday, January 29, 2003 1:25 PM
Subject: RE: Critical missing feature in SSML specification

I understand why the scenario you describe requires a "stop" command. I do
not understand how a <STOP> markup tag would fulfill these requirements. It
seems to me that the SSML markup would be already generated and in process
of being spoken by the TTS engine when an event that initiates the "stop"
command occurs. I can envision how a scripted object might accomplish this,
but not how a <STOP> markup tag would do so.

Perhaps you could explain.

Glen Shires
Intel Corporation

-----Original Message-----
From: Richard Schwerdtfeger [mailto:schwer@us.ibm.com]
Sent: Wednesday, January 29, 2003 9:37 AM
To: www-voice@w3.org
Cc: w3c-wai-pf@w3.org
Subject: Critical missing feature in SSML specification
Importance: High

In reviewing the SSML specification we (PF Group) overlooked an extremely
critical missing feature in the last call draft.

It is absolutely essential that SSML support a <STOP> command.


Screen reader users will often hit the stop command to tell the speech
synthesizer to stop speaking. Screen Readers would use the <MARK>
annotation as a way to have the speech engine tell the screen reader when
speech has been processed (marker processed). In the event that the user
tells the screen reader to stop speaking the screen reader should be able
to send a stop command to the speech engine which would utltimately flush
the speech buffers. Markers not returned would help the screen reader know
where the user left off in the user interface (maintain point of regard
relative to what has been spoken).

I apologize for not submitting this in our last call review but this is a
hard requirement. Otherwise, we SSML cannot support screen readers.


Rich Schwerdtfeger
STSM, Software Group Accessibility Strategist
Emerging Internet Technologies
Chair, IBM Accessibility Architecture Review  Board
schwer@us.ibm.com, Phone: 512-838-4593,T/L: 678-4593

"Two roads diverged in a wood, and I -
I took the one less traveled by, and that has made all the difference.",
Received on Wednesday, 29 January 2003 13:48:18 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:07:36 UTC