W3C home > Mailing lists > Public > public-xg-htmlspeech@w3.org > August 2011

RE: Protocol call tomorrow

From: Young, Milan <Milan.Young@nuance.com>
Date: Wed, 17 Aug 2011 12:38:37 -0700
Message-ID: <1AA381D92997964F898DF2A3AA4FF9AD0C744C81@SUN-EXCH01.nuance.com>
To: Robert Brown <Robert.Brown@microsoft.com>, "Michael Johnston (AT&T)" <johnston@research.att.com>, "Patrick Ehlen (AT&T)" <pehlen@attinteractive.com>, "Dan Burnett (Voxeo)" <dburnett@voxeo.com>, "Marc Schroeder (DFKI)" <marc.schroeder@dfki.de>
CC: HTML Speech XG <public-xg-htmlspeech@w3.org>
Task #6 - Parameters


My recommendation is to use MRCP v2 "stickiness" semantics for all of
those parameters defined within that spec.  In general, this means that
writeable parameters set with a SET-PARMS are good until a subsequent
SET-PARAMs overwrites the value or the session ends.  Authors are free
to supply an "override" value for a particular method, but that last
value is implicitly restored when that method transaction completes.


The parameters which are specific to the HTML Speech protocol should
follow the same paradigm.  We may decide, however, that some of the
parameters like "audio-codec" may not be used for a method override.  A
few additional comments:

*        I don't understand why we would prevent
"active/inactive-grammars" from being included in SET-PARAMS.   Not so
much that there is a strong use case for doing this, but singling this
out breaks consistency.

*        Should note that the "Stream-ID" synthesis parameter is

*        MRCP v2 uses case-insensitive parameter names.  Suggest we
follow suite with HTML Speech parameters.


Regarding standardizing URI parameters, I think we agreed to push that
out to the next major version of the protocol spec.



Task #8 - Inactive Grammars


Let's start by defining an inactive grammar.  In my thinking, this is
some grammar which was at one time loaded and parsed by the service, but
is not presently active in a request.  Such grammar made their way into
the service in one of two ways:

  DEFINE-GRAMMAR  - Handles to these grammars SHOULD be held by the
client.  If the client somehow looses track of this list, it can always
re-issue the DEFINE-GRAMMAR request, and this MUST not result in a
service error as long as the mapping is consistent with the original
request.  Once in place, the handle definition MUST be honored by the
service for the duration of the session.  If the service runs low on
resources, it is free to unload the handle's payload, but it must always
continue to honor the handle as though the payload were still available
(performance not withstanding).

  LISTEN/SET-GRAMMR - The now inactive grammar was once an active
grammar in a request.  Unlike explicitly DEFINEd grammars, such grammars
do not have handles that persist past the scope of the parent request.


Assuming we agree on the above, the use case for reporting inactive
grammars is slim.  I find it unlikely that a client would ever need to
read this value as it would probably already have that information in
memory.  For that matter, reporting the active grammar list is probably
unnecessary for the same reason, but at least the active-grammar list is
useful for logging and so should remain.




Other Thoughts

*        I wasn't present when it was decided to rename RECOGNIZE to
LISTEN.  I'm not sure if I support this move, but either way it probably
makes sense that the completion state (currently RECOGNITION-COMPLETE)
would use the same naming convention.

*        Vendor-Specific-Parameters was another casualty in my absence.
This seems to be a pretty easy parameter to spec so I don't understand
why it was pulled.  Perhaps we can discuss this on the next call.







From: Robert Brown [mailto:Robert.Brown@microsoft.com] 
Sent: Wednesday, August 10, 2011 2:29 PM
To: Michael Johnston (AT&T); Patrick Ehlen (AT&T); Dan Burnett (Voxeo);
Marc Schroeder (DFKI); Young, Milan
Cc: HTML Speech XG
Subject: Protocol call tomorrow


There won't be a protocol call tomorrow. I figure SpeechTek will have
some people distracted, and we still have work-items assignments from
last week to get through before we bite off anything else.


Rather than have the call, please refer to the list below, and reply
with ETA for your work items if you have any. (The ETA for my stuff is
Tuesday 16th.)






-- Michael:


1.       TODO: Add a sentence or two about the higher level motivation.
[Michael Johnston]

4.       TODO: Write the rationale for why we mix media and signal in
the same session. [Michael Johnston]

10.   TODO: no match is returned, is the EMMA no-match document
required? [Yes. Michael will provide an example of uninterpreted]

11.   TODO: Insert some EMMA document examples.

[Michael - a 1-best example, an n-best example, and a lattice example]


-- Milan:


6.       TODO: Specify which headers are sticky. URI request parameters
aren't standardized. [Milan]


8.       TODO: Should GET-GRAMMARS also return the list of inactive
grammars/rules? It's not clear how that would be useful. Also, the list
of inactive rules could be rather long and unwieldy [Milan will review


-- Robert:


2.       TODO: add a section on security. Include authentication,
encryption, transitive authorization to fetch resources. [Robert Brown]

7.       TODO: Specify Completion-Cause value for no input stream.

[Robert - pick something away from the existing values]



Received on Wednesday, 17 August 2011 19:39:10 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:16:50 UTC