RE: Speech API: first editor's draft posted from Jim Barnett on 2012-04-13 (public-speech-api@w3.org from April 2012)

From: Jim Barnett <Jim.Barnett@genesyslab.com>
Date: Fri, 13 Apr 2012 06:58:07 -0700
To: "Hans Wennborg" <hwennborg@google.com>, <public-speech-api@w3.org>
Cc: "Satish S" <satish@google.com>, "Glen Shires" <gshires@google.com>
Message-ID: <E17CAD772E76C742B645BD4DC602CD8106085E65@NAHALD.us.int.genesyslab.com>

A couple of quick comments:
1) The current TTS API sets the string to be played as a single action.
Could it be useful to add an 'append' function, allowing the programmer
to append text to the string that is already playing?  I'm thinking of
the case where the page wants to play out a large document, possibly one
that's being streamed from another source.  We'd have to handle the edge
cases, of course.  For example, I think that doing an append after the
play had stopped (when 'ended'==true) would simply set the text field
and a new start() would be required before play would resume.

2) In the ASR example, should 
           q.value = event.result[0].transcript;
be
      q.value = event.item[0].transcript;

I am assuming that 'event' is of type SpeechRecognitionResult.  In any
case, it would be useful if the text was explicit about the type of
object returned in the continuous==true and continuous==false cases.

- Jim

-----Original Message-----
From: Hans Wennborg [mailto:hwennborg@google.com] 
Sent: Thursday, April 12, 2012 10:36 AM
To: public-speech-api@w3.org
Cc: Satish S; Glen Shires
Subject: Speech API: first editor's draft posted

In December, Google proposed [1] to public-webapps a Speech JavaScript
API that subset supports the majority of the use-cases in the Speech
Incubator Group's Final Report. This proposal provides a programmatic
API that enables web-pages to synthesize speech output and to use speech
recognition as an input for forms, continuous dictation and control.

We have now posted in the Speech-API Community Group's repository, a
slightly updated proposal [2], the differences include:

 - Document is now self-contained, rather than having multiple
references to the XG Final Report.
 - Renamed SpeechReco interface to SpeechRecognition
 - Renamed interfaces and attributes beginning SpeechInput* to
SpeechRecognition*
 - Moved EventTarget to constructor of SpeechRecognition
 - Clarified that grammars and lang are attributes of SpeechRecognition
 - Clarified that if index is greater than or equal to length, returns
null

We welcome discussion and feedback on this editor's draft. Please send
your comments to the public-speech-api@w3.org mailing list.

Glen Shires
Hans Wennborg

[1]
http://lists.w3.org/Archives/Public/public-webapps/2011OctDec/1696.html
[2] http://dvcs.w3.org/hg/speech-api/raw-file/tip/speechapi.html

Received on Friday, 13 April 2012 13:58:50 UTC