- From: Gregory J. Rosmaita <oedipus@hicom.net>
- Date: Wed, 11 Jul 2007 15:43:02 -0400
- To: wai-xtech@w3.org
what follows is an excerpt from my reply to ben when i emailed him about the status of the plans he had outlined, and which can be found at: http://lists.w3.org/Archives/Public/wai-xtech/2007May/0048.html this was just a quick knee-jerk reaction to some of the problems ben is experiencing as an implementor so as to accessify and increase the usability of the reCAPTCHA interface. for those of you joining this thread late, you can test the reCAPTCHA yourself at: http://www.recaptcha.net/ basically what follows is some very basic low-level advice, but since i placed an excerpt from ben's personal reply to the wai-xtech list http://lists.w3.org/Archives/Public/wai-xtech/2007Jul/0009.html it is only fitting and proper to also archive the other side of the conversation, in an attempt to avoid redundancy at such a meta-level; i don't believe there to be anything inflamatory about the information i provided him, which is a polite way of saying, if you know of a screen reader developer (windows, linux, and mac) please post that info to this thread, rather than argue over market-share for products x y and z ---- Begin Excerpt of GJR's Reply to Ben M ------- as for screen-readers, you can either try the free Non-Visual Desktop Access (NVDA) screen reader for windows: http://en.wikipedia.org/wiki/NonVisual_Desktop_Access (description) http://www.nvda-project.org/ (the NVDA project's site) there is also charles chen's FireVox self-voicing extention for FireFox: http://clc4tts.clcworld.net/clc-firevox_doc.html which is also free... also, you might try contacting the major screen reader and screen magnification software developers -- i'm sure that there would be buy in for the screen-reader or screen readers that helped make reCAPTCHA accessible... here are the 3 major players in the magnification and/or screen-reader business: http://www.freedomsci.com/ (JAWS and MAGIC) http://www.gw-micro.com/ (Window-Eyes) http://www.yourdolphin.com/ (HAL and SuperNova) of particular note is that HAL offers the user a choice of keyboard layouts -- one can overlay the JAWS keyboard or the Window-Eyes keyboard (or use the native HAL keyboard) which makes the learning curve a little less steep, as it is easier to switch to something an individual user is familiar with -- the keyboard overlay for another screen reader's commands, but with the functionality of HAL or SuperNova (more bang for your buck if you have to purchase) i like the 7 out of 8 rule, but worry that those with cognative problems may be flummoxed by the backwards looping; i think a switch from the google-like implementation to alternating voices of different characteristics (male to female to deep male voice to childlike voice, etc.) would be sufficient to trick voice-recognition software, such as in the (real life) case of a blind user who also happens to be a double-amputee, and has to rely on speech-input as well as speech output: if the user is male, he needs to train the voice-recognition software to correctly interpret his vocal commands (while wearing an earpiece, so that the software cannot hear his screen-reader's output) AND then switch his screen reader voice and/or voices to the opposite sex, so that when both voice-input and voice-output are used together, it responds only to his voice, and NOT that which issues from a speech synthesizer, so that the voice-recognition software does not misinterpret synthesized speech, as issuing commands or entering text into an editable document; obviously, if the user is female (real life case: paralysis of one arm, and an arthritis in the other arm which is so severe that her hand is locked into a clenched position) you simply do the opposite of what i just outlined above -- use male voices for speech output, and train the voice-recognition software to recognize your (female) voice... so, if each alpha-numeric character was spoken in alternating voices, such as "average male" to "average female" to "male child" to "female child", and so on, i think you could foil attempts at speech recognition not by adding noise and backwards loops, but simply by changing voices within an audio captcha thanks again for all of your time and patience (we are trying to get someone to take responsibility for firing aural clues using ACSS when triggered by user interaction or by their assisstive technology (especially curtis, as FireVox supports the CSS3-speech module) -- one path i'm ivestigating is to integrate XML Events into HTMLx (the work of the HTML WG) and into XHTML2 precisely for the purpose of firing purely aural cues (which help a lot more people than the blind, when actually implemented) -- the problem is, no one wants to allow anyone access to their OS' native embedded audio renderer -- the beneath the hood engine that fires system sounds and plays embedded files without opening a media player instance, so the long-term approach is to define an XML Event that is interoperable and standard across platforms, so that there is more buy-in by developers... but for now, let's deal with today's technological realities, with an eye to the future development path that the HTML WG ends up pursuing in the area of forms -- and, hopefully, that path will lead to integration with XForms... type at you soon, gregory. -------------------------------------------------------------- DISCUSSION, n. A method of confirming others in their errors. -- Ambrose Bierce, The Devil's Dictionary -------------------------------------------------------------- Gregory J. Rosmaita, oedipus@hicom.net Camera Obscura: http://www.hicom.net/~oedipus/ --------------------------------------------------------------
Received on Wednesday, 11 July 2007 19:43:12 UTC