W3C home > Mailing lists > Public > public-speech-api-contrib@w3.org > May 2012

The Speech API needs support for event listeners for synthesis events

From: Gerardo Capiel <gerardoc@benetech.org>
Date: Thu, 3 May 2012 23:53:35 +0000
To: "<public-speech-api-contrib@w3.org>" <public-speech-api-contrib@w3.org>
Message-ID: <9A572A20-4DC0-46C5-BD75-0BFDFF91F8F7@Benetech.org>
I'm the VP of Engineering at Benetech, the nonprofit behind Bookshare (http://bookshare.org) - the world's largest library of accessible ebooks for people with print disabilities (e.g. blind, dyslexic, cerebral palsy).

Over 70% of our 200K users have learning disabilities, such as dyslexia, and need synchronized highlighting of words as they are being spoken by a TTS engine.  We are planning to integrate the Google Chrome specific TTS APIs into the open source Readium (http://readium.org) EPUB 3 ebook reader to fulfill this use case in a web environment.

To validate market acceptance of this use case, below are examples of vendors to the dyslexic community, which have implemented this synchronized word-level highlighting capability in their applications:

Don Johnston: ReadOut:Loud - http://www.donjohnston.com/products/read_outloud/index.html
Bookshare/Shinano: Read2Go - http://read2go.org/
textHELP: Read&Write Gold - http://www.texthelp.com/North-America/our-products/readwrite
Freedom Scientific: WYNN -  http://www.freedomscientific.com/LSG/products/wynn_features.asp
Levelware: InDAISY - http://levelware.com/

To implement such features in a web application, the TTS engine needs to be able to support JS based synthesized event handlers.  Google implemented a callback mechanism in their Chrome TTS APIs by supporting an event handler as part of the speak() method (http://code.google.com/chrome/extensions/tts.html#events).  The callbacks tied to synthesis events are ideally at the word level or triggered off SSML markers.

You can see some demo's of this capability at the following links:
https://github.com/gcapiel/ChromeWebAppBookshareReader/downloads (install extension in Chrome via .crx download)
https://chrome.google.com/webstore/detail/chhkejkkcghanjclmhhpncachhgejoel (the FLITE voice supports callbacks, so install that first https://chrome.google.com/webstore/detail/edimkjalobeaakbgjdeikeimmacjdppn)

I would highly urge that the Speech API be extended with these capabilities, so that our dyslexic users are not limited to Google Chrome for web based reading and so that the general dyslexic community can benefit from this technology in other web based applications.

Sincerely,

Gerardo

Gerardo Capiel
VP of Engineering, Benetech<http://benetech.org>
650-644-3405
http://twitter.com/gcapiel
Fork, Code, Do Social Good: http://benetech.github.com/
Received on Thursday, 3 May 2012 23:54:14 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 3 May 2012 23:54:15 GMT