- From: Daniel Weck via cvs-syncmail <cvsmail@w3.org>
- Date: Wed, 06 Jul 2011 19:47:26 +0000
- To: public-css-commits@w3.org
Update of /sources/public/csswg/css3-speech In directory hutz:/tmp/cvs-serv7174 Modified Files: Overview.html Overview.src.html Log Message: attempt to clarify voice selection Index: Overview.html =================================================================== RCS file: /sources/public/csswg/css3-speech/Overview.html,v retrieving revision 1.61 retrieving revision 1.62 diff -u -d -r1.61 -r1.62 --- Overview.html 6 Jul 2011 19:03:45 -0000 1.61 +++ Overview.html 6 Jul 2011 19:47:24 -0000 1.62 @@ -1753,11 +1753,13 @@ <p>The ‘<a href="#voice-family"><code class=property>voice-family</code></a>’ property specifies a - comma-separated, prioritized list of values that designate speech - synthesis voices (analogous to ‘<code class=css><a - href="#font-family-def"><code + comma-separated, prioritized list of component values that are separated + by a comma to indicate that they are alternatives (this is analogous to + ‘<code class=css><a href="#font-family-def"><code class=property>font-family</code></a></code>’ in visual style - sheets), where: + sheets). Each component value potentially designates a speech synthesis + voice instance, by specifying match criteria (see the <a + href="#voice-selection">voice selection</a> section on this topic). <p> <strong><generic-voice></strong> = [<age>? <gender> <integer>?] @@ -1867,27 +1869,37 @@ <p>The ‘<a href="#voice-family"><code class=property>voice-family</code></a>’ property is used to guide - the selection of the speech synthesis voice. As part of this selection - process, speech-capable user agents must also take into account the - language of the selected element within the markup content. The "name", - "gender", "age", and preferred "index" are voice selection hints that get - carried down the content hierarchy as the ‘<a - href="#voice-family"><code class=property>voice-family</code></a>’ - property value gets inherited by descendant elements. At any point within - the content structure, the language takes precedence (i.e. has a higher - priority) over the specified CSS voice characteristics. The following list - outlines the selection algorithm (note that the definition of "language" - is loose here, in order to cater for dialectic variants): + the selection of the speech synthesis voice instance. As part of this + selection process, speech-capable user agents must also take into account + the language of the selected element within the markup content. The + "name", "gender", "age", and preferred "variant" (index) are voice + selection hints that get carried down the content hierarchy as the + ‘<a href="#voice-family"><code + class=property>voice-family</code></a>’ property value gets + inherited by descendant elements. At any point within the content + structure, the language takes precedence (i.e. has a higher priority) over + the specified CSS voice characteristics. + + <p> The following list outlines the voice selection algorithm (note that + the definition of "language" is loose here, in order to cater for + dialectic variants): <ol> - <li> If only a single voice is available for the language of the selected - content, then this voice must be used, regardless of the specified CSS - voice characteristics. + <li> If only a single voice instance is available for the language of the + selected content, then this voice must be used, regardless of the + specified CSS voice characteristics. - <li> If several voices are available for the language of the selected - content, then the chosen voice is the one that most closely matches the - specified gender, age, and preferred voice variant. The actual definition - of "best match" is processor-dependent. + <li> If several voice instances are available for the language of the + selected content, then the chosen voice is the one that most closely + matches the specified name, or gender, age, and preferred voice variant. + The actual definition of "best match" is processor-dependent (e.g. a + reasonable match for "voice-family: 10 male;" may well be a higher + pitched female voice suitable for a young boy's vocal rendition). If no + voice instance matches the characteristics provided by any of the + ‘<a href="#voice-family"><code + class=property>voice-family</code></a>’ component values, the first + available voice instance (amongst those suitable for the language of the + selected content) must be used. <li> If no voice is available for the language of the selected content, it is recommended that user-agents let the user know about the lack of Index: Overview.src.html =================================================================== RCS file: /sources/public/csswg/css3-speech/Overview.src.html,v retrieving revision 1.62 retrieving revision 1.63 diff -u -d -r1.62 -r1.63 --- Overview.src.html 6 Jul 2011 19:03:45 -0000 1.62 +++ Overview.src.html 6 Jul 2011 19:47:24 -0000 1.63 @@ -1383,9 +1383,12 @@ </tr> </tbody> </table> - <p>The 'voice-family' property specifies a comma-separated, prioritized list of values that - designate speech synthesis voices (analogous to '<a href="#font-family-def"><code - class="property">font-family</code></a>' in visual style sheets), where: </p> + <p>The 'voice-family' property specifies a comma-separated, prioritized list of component values + that are separated by a comma to indicate that they are alternatives (this is analogous to '<a + href="#font-family-def"><code class="property">font-family</code></a>' in visual style + sheets). Each component value potentially designates a speech synthesis voice instance, by + specifying match criteria (see the <a href="#voice-selection">voice selection</a> section on + this topic). </p> <p> <strong><generic-voice></strong> = [<age>? <gender> <integer>?] </p> <p class="note"> Note that the functionality provided by this property is related to the <a @@ -1462,21 +1465,26 @@ voice-family: "john doe", "Henry the-8th";</pre> </div> <h4 class="no-toc" id="voice-selection">Voice selection, content language</h4> - <p>The 'voice-family' property is used to guide the selection of the speech synthesis voice. As - part of this selection process, speech-capable user agents must also take into account the - language of the selected element within the markup content. The "name", "gender", "age", and - preferred "index" are voice selection hints that get carried down the content hierarchy as the - 'voice-family' property value gets inherited by descendant elements. At any point within the - content structure, the language takes precedence (i.e. has a higher priority) over the - specified CSS voice characteristics. The following list outlines the selection algorithm (note - that the definition of "language" is loose here, in order to cater for dialectic - variants):</p> + <p>The 'voice-family' property is used to guide the selection of the speech synthesis voice + instance. As part of this selection process, speech-capable user agents must also take into + account the language of the selected element within the markup content. The "name", "gender", + "age", and preferred "variant" (index) are voice selection hints that get carried down the + content hierarchy as the 'voice-family' property value gets inherited by descendant elements. + At any point within the content structure, the language takes precedence (i.e. has a higher + priority) over the specified CSS voice characteristics. </p> + <p> The following list outlines the voice selection algorithm (note that the definition of + "language" is loose here, in order to cater for dialectic variants):</p> <ol> - <li> If only a single voice is available for the language of the selected content, then this - voice must be used, regardless of the specified CSS voice characteristics. </li> - <li> If several voices are available for the language of the selected content, then the chosen - voice is the one that most closely matches the specified gender, age, and preferred voice - variant. The actual definition of "best match" is processor-dependent.</li> + <li> If only a single voice instance is available for the language of the selected content, + then this voice must be used, regardless of the specified CSS voice characteristics. </li> + <li> If several voice instances are available for the language of the selected content, then + the chosen voice is the one that most closely matches the specified name, or gender, age, + and preferred voice variant. The actual definition of "best match" is processor-dependent + (e.g. a reasonable match for "voice-family: 10 male;" may well be a higher pitched female + voice suitable for a young boy's vocal rendition). If no voice instance matches the + characteristics provided by any of the 'voice-family' component values, the first available + voice instance (amongst those suitable for the language of the selected content) must be + used. </li> <li> If no voice is available for the language of the selected content, it is recommended that user-agents let the user know about the lack of appropriate TTS voice. </li> </ol>
Received on Wednesday, 6 July 2011 19:47:28 UTC