- From: Al Gilman <al.gilman@comcast.net>
- Date: Tue, 20 Jan 2004 13:46:22 -0500
- To: tvraman@almaden.ibm.com
- Cc: wai-xtech@w3.org
I don't know if you saw this question. It affects the definitions and processing of speech parameters. If we are going to baseline this on SSML in the future we should endeavor to get it right in SSML. In summary, the question has to do with the model or persistent form for relative pitch changes. SSML allows such pitch changes to be indicated in either linear-in-frequency Hz units or logarithmic-in-frequency st units. One would think that the same relative pitch change for different voices would be a change which is comparable in pitch (st) units rather than in frequency (Hz) units. But should this be true even if the author indicates the change in Hz? That would mean that the 'model' or persistent definition of relative pitch changes was always a frequency ratio. And that the ability to indicate this in Hz is an authoring convenience or macro, not a direct representation of the persistent property. Do you have an opinion? How important do you think this is? Al >Date: Tue, 20 Jan 2004 15:00:49 +0100 >From: "david.descamps-be" <david.descamps-be@laposte.net> >To: "www-voice" <www-voice@w3.org> >Subject: Relative Pitch change in Hz or st through voice change >X-Archived-At: >http://www.w3.org/mid/HRSK9D$BAA413E05AA48CE9A6803C9CF2DDF774@laposte.net > >quote: >"... >Relative changes in prosodic parameters should be carried across voice >changes. However, different voices have different natural defaults for >pitch, speaking rate, etc. because they represent different >personalities, so absolute values of the prosodic parameters may vary >across changes in the voice. >..." > >How the syntesis processor have to deal with relative pitch change in >Hertz or in semitone? > >/****** > >For example, you have a male voice (baseline pitch of : +/- 100Hz) with > >a relative change of 10Hz. You change the voice in a female one >(baseline pitch : +/- 180Hz) and you keep the relative pitch change: > >male : 100 -> 110 Hz : +10.0% >female : 180 -> 190 Hz : + 5.5% > >the proportion of your relative pitch change in Hertz have been >corrupted! >*******/ > >How the synthesis processor have to deal with relative change in Hz or >st through a voice change: keep the value or keep the proportion? > > >Yours, >David Descamps Accédez au courrier électronique de La Poste : >www.laposte.net ; 3615 LAPOSTENET (0,34€/mn) ; tél : 08 92 68 13 50 (0,34€/mn)
Received on Tuesday, 20 January 2004 13:47:11 UTC