[emma] Loquendo Implementation Report for EMMA 1.0 Specification

Loquendo Speech Technologies

Executive Summary
 
Loquendo is a strong believer in the considerable advantages that speech
and multimodal standards can bring to speech markets, and continues to
actively support their development and deployment. As a participating
member of the W3C Multimodal Interaction working group, Loquendo
welcomes the Extensible MultiModal Annotation (EMMA) 1.0 Candidate
Recommendation.

EMMA 1.0 allows to create rich annotations for inputs of different
modalities within a Multimodal Application. For instance, EMMA 1.0 is
used as an annotation format for speech and DTMF input within Media
Resource Control Protocol version 2 (MRCPv2). However, EMMA can also be
used by gesture or pen modalities, and it offers interesting features to
represent complex semantic information within an Interaction Manager.

Loquendo is very pleased to be able to contribute by submitting an EMMA
1.0 Implementation Report which covers the relevant features for an EMMA
producer of voice and DTMF results. EMMA 1.0 results are already
available for the Loquendo MRCP Server (a.k.a. Loquendo Speech Suite) to
promote its quick adoption for the benefit of the speech market,
especially for the integration of advanced speech technologies by means
of MRCPv2 protocol in present and future platforms, both in speech /
DTMF contexts and, more in general, in Multimodal application contexts.

Loquendo is continuing to give its complete and wholehearted support to
the work of the W3C Multimodal Interaction and Voice Browser working
groups, as well as to the IETF and the Voice XML Forum, as part of its
continuing commitment and participation in the evolution of this and
other standards. 

Technical Details

Loquendo's EMMA results are focused on the production of EMMA markup
in a speech and DTMF context. A few points that should be noted are:

-	TA #301 : N-best interpretations within emma:one-of element
      are ordered best-first in document order on an acoustic
      score criteria, even if emma:confidence is present.
-	TA #606 : Unable to create an epsilon transition emma:arc 
      without content. Should this be optional?
-	TA #1501 : There is no evidence in EMMA 1.0 CR of this
statement.
      Loquendo asks to remove this Test Assertion from EMMA IR.
-	TA #2100 and #2101 : Note that it is very hard to have absolute
      times in a client- server ASR implementation.
-	TA #2500 : The lattice included emma:cost annotations, but the
      current value was fixed. In future it will contain the actual
cost.
-	Some annotations are under consideration for future
implementations,
      e.g. emma:model, emma:grammar, emma:info, emma:process,
emma:signal,
      emma:signal-size, emma:media-type, emma:source, emma:start,
emma:end,
      emma:dialog-turn.

Paolo Baggia, Loquendo


Gruppo Telecom Italia - Direzione e coordinamento di Telecom Italia S.p.A.

================================================
CONFIDENTIALITY NOTICE
This message and its attachments are addressed solely to the persons above and may contain confidential information. If you have received the message in error, be informed that any use of the content hereof is prohibited. Please return it immediately to the sender and delete the message. Should you have any questions, please send an e_mail to <mailto:webmaster@telecomitalia.it>webmaster@telecomitalia.it. Thank you<http://www.loquendo.com>www.loquendo.com
================================================

Received on Wednesday, 9 April 2008 14:51:53 UTC