[EMOXG] xml spec links2 position on a timeline

Hi all,
I just have the idea that it might be useful, with text-to-speech
synthesis in mind, to be able to express emotion state duration not only
with respect to time or sample values but also as a percentage.
Example for an speech synthesizer going from sadness to anger (embedded
in SSML):

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis"
xml:lang="en-US">
  <voice gender="female"> 
    <prosody contour="(0%,+20Hz)(10%,+30%)(40%,+10Hz)">
       Hi, am sad know but start getting angry...
    </prosody>
  </voice>
  <emotion> 
   <category name="sadness"/>
   <timing start="10%" end="50%"/>
  </emotion>
  <emotion> 
   <category name="anger"/>
   <timing start="50%" end="100%"/>
  </emotion>
</speak>


Otherwise it would be difficult to specifiy a progression when absolute
duration is unknown. 

Regards,
felix

Received on Wednesday, 20 August 2008 15:19:11 UTC