W3C home > Mailing lists > Public > www-voice@w3.org > October to December 2003

Re: VoiceXML vs SALT

From: Dean Sturtevant <deansturtevant@comcast.net>
Date: Fri, 7 Nov 2003 07:42:12 -0500
Message-ID: <002501c3a52c$91cfbac0$b7cf6041@presario>
To: "Daniel Burnett" <burnett@nuance.com>, "Ildar Gabdulline" <ildar@realeastnetworks.com>, <www-voice@w3.org>

One might argue, and I will, that VoiceXML is TOO high level. I am not
acquainted with SALT, but a low-level interface is in fact the appropriate
kind of interface for a standard, in my view. Higher level constructs can be
provided as add-ons, freely distributed. VoiceXML does not provide the
flexibility needed to support voice applications of any complexity. In
addition it needlessly (in my view) presents a new programming paradigm (the
FIA) which is flawed in a few respects. Better would be to provide low-level
constructs that can be used in association with rich and well-defined
control languages such as Java or Python. If SALT supports this notion, then
it should be considered to be superior to VoiceXML (as long as the
underlying functionalities of TTS, ASR, telephony, and audio output are
available from SALT).

- Dean Sturtevant (representing the views of myself only)

----- Original Message ----- 
From: "Daniel Burnett" <burnett@nuance.com>
To: "Ildar Gabdulline" <ildar@realeastnetworks.com>; <www-voice@w3.org>
Sent: Thursday, November 06, 2003 8:43 PM
Subject: RE: VoiceXML vs SALT

Dear Ildar,

SALT and VoiceXML can both be used for multimodal applications, and for
simultaneous as well as sequential multimodality.  Just as SALT tags
must be combined with XHTML to form a multimodal language, so must
VoiceXML be coupled with XHTML.  The XHTML + Voice language (X+V), a
submission to the W3C Multimodal Interaction Working Group, provides a
formal means for doing this, although other approaches to using VoiceXML
in multimodal systems have also been demonstrated.  VoiceXML and SALT
can of course both be used in voice-only applications.

In addition to speech markup features, VoiceXML also includes high-level
markup for controlling the voice dialog.  In SALT, the developer has to
worry about many low-level details such as handling errors, managing
dialog flow, and preventing dialogs from getting into the "hanging state"
(See SALT 1.0 section 2.6.5).  These tasks are all automatically taken
care of in VoiceXML.

SALT tightly combines the visual view and the voice modes through its
heavy reliance on ECMAScript.  In contrast, X+V loosely couples the modes.
This enables distributed architectures that place lower demands on
clients and networks and that leverage the burgeoning ecosystem of
VoiceXML servers and applications.

While VoiceXML and SALT are both markup languages for voice and
multimodal applications, only VoiceXML is on a W3C Recommendation track
today, a significant factor in the huge industry uptake of the language.
It is widely deployed and is commonly accepted as the worldwide standard
for voice applications.  It is in use today in high volume, robust, and
reliable commercial systems.  We would be more than happy to provide more
detailed pointers to such systems upon request.


Technical Council
VoiceXML Forum

-----Original Message-----
From: Ildar Gabdulline [mailto:ildar@realeastnetworks.com]
Sent: Sunday, October 26, 2003 7:20 AM
To: www-voice@w3.org
Subject: VoiceXML vs SALT


I am relatively new to voice dialogs.
Could you please describe me - what are the differences between VoiceXML and
As I understood for the moment both of them are used for the same purposes -
programming of the dialogs.
If this is correct then it seems that having two standard families is

Please clarify the situation, if it is possible.

Received on Friday, 7 November 2003 07:40:29 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:07:37 UTC