- From: Scott McGlashan <scott.mcglashan@pipebeach.com>
- Date: Sun, 17 Feb 2002 00:34:11 +0100
- To: "George Clelland" <george_clelland@uk.ibm.com>
- Cc: <www-voice@w3.org>
Hi George,
sorry for the delay in getting you this offical responses.
Below is our reponse to your questions (indicated by >>>).
If you are not satisfied with the reply, or want more information, let
me know.
Scott
(Chairman, W3C VB Dialog Committee)
-----Original Message-----
From: George Clelland [mailto:george_clelland@uk.ibm.com]
Sent: 23 November 2001 18:16
To: www-voice@w3.org
Subject: VoiceXML 2.0 comments
I have some personal comments on the VoiceXML 2.0 Working Draft.
An additional form of transfer would be useful. A standard
supervised
transfer would be useful, where the interpretor disconnects from
the
telephone call after a successful connection has been established
with
the third party. This would be beneficial for call centre
applications.
>>> We have received a number of request for additonal forms of
transfer. In VoiceXML 2.0, our primary goal has been to clarify the
VoiceXML 1.0 language, rather than add new features in this version
(especially since there is a separate activity in the working group
looking at more advanced Call Control functionality). For this reason,
we defer this feature request for consideration in a later version of
the language.
The text associated with the code example give in section 3.1.5 has
an
error. The text states the result will be "AMEX if the caller
enters
DTMF 1; the text should say DTMF 3.
>>> This will be corrected in the next draft.
Many of speech markup elements are too complex for application
programmers, and should only be used by specialists. Given that
the
philosphy of voiceXML is to make speech applications easier to
develop
and minimise the specialist speech knowledge required, elements
such
as <prosody> and <phoneme> seem to contradict this goal.
>>> Application developers do not need to use SSML. However, as they
become more experienced, and applications become more demanding, the
advanced features of SSML may become useful.
The first code example in section 4.1.3 has an error. The <say-as>
element has an old parameter, namely class rather than type.
>>> This will be corrected in the next draft.
The discussion on prompt queuing and input collection in section
4.1.8
clarifies the operation of interpretors. However, I believe the
current operation is flawed in that it forces the use of fetchaudio
in
order to have a prompt at the end of one document played before the
next document is fetched. A typical call flow will have a prompt
telling the caller to wait while some data is retrieved, but with
the
current operation, this prompt is not played to the caller until
after
the second document is fetched, unless fetechaudio is used. This is
not an intuitive operation, as the addition of the optional
fetchaudio changes the way the application works and leads to
unexpected results. I would propose that all queued prompts are
played
as soon as a document is to be fetched and then the fetchaudio if
specified, this is much more obvious form of operation for
application
developers.
>>> We have seem many applications where it is highly desirable to queue
prompts and then play them when the next document is loaded. To obtain
the behavior you want, there are a number of options. 1) Use a
fetchaudio to read out the message out, e.g. "Now fetching your data",
2) put the message in a queued prompt and add a 'dummy' fetchaudio (eg.
500msecs of slience) to force the queue to flush prior to fetching, or
3) Put the message in a preceding field with a zero 'timeout': this
forces the queue to flush prior to document flush.
I would like to propose an additional fetch property (section
6.3.5)
.... fetchaudiorepeatdelay ==> defined as the delay between
successive
plays of the fetchaudio.
We have considered this feature for VoiceXML 2.0 but have decided to
defer it for future versions of the language where we will provide a
more powerful model for generating output (and collecting input) during
document transition.
George Clelland
EMEA Voice Systems
DirectTalk & Message Center pre-sales Technical Support
IBM UK Laboratories
Hursley Park, Mail Point 104
Winchester
Hants, UK SO21 2JN
email: george_clelland@uk.ibm.com Tel: +44 (0)1962 816657 Fax:
+44
(0)1962 816800
Received on Saturday, 16 February 2002 18:31:57 UTC