[wiki] Multimodal User Input from Adam Sobieski on 2014-04-05 (public-collaboration@w3.org from April 2014)

From: Adam Sobieski <adamsobieski@hotmail.com>
Date: Sat, 5 Apr 2014 10:43:31 +0000
To: "public-collaboration@w3.org" <public-collaboration@w3.org>
Message-ID: <SNT405-EAS1205246647B6D9DD03864F6C5570@phx.gbl>

Collaborative Software Community Group,




On the topics of multimodality and document services, Document Services API could process multimodal user input, audio input, document object model elements from speech to XML (e.g. SSML) or data from speech recognition components (https://dvcs.w3.org/hg/speech-api/raw-file/tip/speechapi.html, https://dvcs.w3.org/hg/speech-api/raw-file/tip/speechapi.html#speechreco-resultlist).




Document services could refine speech recognition outputs (http://en.wikipedia.org/wiki/Outline_of_natural_language_processing#Component_processes_of_natural_language_understanding) and provide feedback on multimodal data, e.g. rate, projection, movement, vocal variety and prosody of spoken language, with use cases including enhanced speech recognition and facilitating public speaking exercises (see also: http://www.mooc-list.com/tags/public-speaking).










Kind regards,




Adam Sobieski

Received on Saturday, 12 April 2014 19:19:11 UTC