- From: Richard Ishida <ishida@w3.org>
- Date: Wed, 14 Jun 2006 16:14:25 +0100
- To: "'Baggia Paolo'" <paolo.baggia@loquendo.com>, <www-voice@w3.org>
- Cc: <public-i18n-core@w3.org>
Hi Paolo, The clarifications here and in Issue R103-26 [1] helped me better understand the intended usage for ASR (SRGS). I had been previousl thinking only of the scenarios you refer to below as "dictation or for unconstrained ASR". I reread section 1.2 with this in mind, and I can see that you are referring to that now, however, I think that, unless a person is more familiar with the SRGS scenario than I was, they will still not appreciate exactly how this is supposed to work. I would recommend expanding section 1.2 with some of the text you include here and the examples in Issue R103-26. Thanks. RI [1] http://lists.w3.org/Archives/Public/www-voice/2006AprJun/0072.html ============ Richard Ishida Internationalization Lead W3C (World Wide Web Consortium) http://www.w3.org/People/Ishida/ http://www.w3.org/International/ http://people.w3.org/rishida/blog/ http://www.flickr.com/photos/ishida/ ________________________________ From: Baggia Paolo [mailto:paolo.baggia@loquendo.com] Sent: 26 May 2006 15:25 To: www-voice@w3.org Cc: Baggia Paolo; Richard Ishida Subject: Re: [PLS1.0] i18n comment: Not easy in ASR Issue R103-33 Proposed Classification: Clarification / Typo / Editorial Resolution: Reject The two main uses of PLS are for SRGS (ASR) and SSML (TTS). In both these cases the PLS are applied on grapheme to define the phonemes to be recognized (for ASR) and to be pronounced (for TTS). There are other uses of PLS, for instance in a dictation or for unconstrained ASR, but which might not be covered by the current specification. Please indicate whether you are satisfied with the VBWG's resolution, whether you think there has been a misunderstanding, or whether you wish to register an objection. Paolo Baggia, editor PLS spec. ------------------------------------------------------------------------ --- From: Date: Tue, 21 Mar 2006 17:52:25 +0000 To: www-voice@w3.org, public-i18n-core@w3.org Message-Id: <20060321175224.030BA4EFDE@homer.w3.org> Comment from the i18n review of: http://www.w3.org/TR/2006/WD-pronunciation-lexicon-20060131/ Comment 33 At http://www.w3.org/International/reviews/0603-pls10/ Editorial/substantive: E Owner: RI Location in reviewed document: 5.4, 2nd para Comment: "Pronunciations are explicitly bound to one or more orthographies within a element so homophones are easy to handle. See the following examples:" This should say, "homophones are easy to handle for text-to-speech". They are not easy to handle in an ASR context, and there should be an informative note here like in 5.5, but referring to ASR rather than TTS! Gruppo Telecom Italia - Direzione e coordinamento di Telecom Italia S.p.A. ================================================ CONFIDENTIALITY NOTICE This message and its attachments are addressed solely to the persons above and may contain confidential information. If you have received the message in error, be informed that any use of the content hereof is prohibited. Please return it immediately to the sender and delete the message. Should you have any questions, please send an e_mail to <mailto:webmaster@telecomitalia.it>webmaster@telecomitalia.it. Thank you <http://www.loquendo.com>www.loquendo.com ================================================
Received on Wednesday, 14 June 2006 15:14:46 UTC