[voiceinteraction] binary information in XML

I had an action from the last call to look into using binary info in XML for sending audio information (like a wav file) in an EMMA
document. It doesn't look like there's a standard, but there are a lot of ways to do it. One is to encode it as text in "Base64"
encoding. Then an EMMA document with audio information could look like the following:
<emma:emma version="2.0">
    <interpretation id="1"
 	dialog-turn="dialog-turn1"
              emma:start="1641924463040"
 	emma:end="1641924466040"
 	emma:medium="acoustic"
	emma:mode="voice"
 	emma:function="dialog"
 	emma:verbal="true">
             <signal encoding="Base64">
                     UklGRoSdAgBXQVZFZm10IBAAAAABAAEAgD4AAAB9AAACABAAZGF0YWCdAgDz/wMAAwD4/xYA9v/4/+f/8v8HAA8ADQADAPv...
             </signal>
    </interpretation>
</emma:emma>

We could send multiple EMMA documents with parts of the audio to send the audio incrementally. 

Received on Tuesday, 11 January 2022 20:50:22 UTC