[voiceinteraction] binary information in XML from Deborah Dahl on 2022-01-11 (public-voiceinteraction@w3.org from January 2022)

From: Deborah Dahl <Dahl@conversational-Technologies.com>
Date: Tue, 11 Jan 2022 15:50:04 -0500
To: <public-voiceinteraction@w3.org>
Message-ID: <074201d8072c$cf96ed80$6ec4c880$@conversational-Technologies.com>

I had an action from the last call to look into using binary info in XML for sending audio information (like a wav file) in an EMMA
document. It doesn't look like there's a standard, but there are a lot of ways to do it. One is to encode it as text in "Base64"
encoding. Then an EMMA document with audio information could look like the following:
<emma:emma version="2.0">
    <interpretation id="1"
 	dialog-turn="dialog-turn1"
              emma:start="1641924463040"
 	emma:end="1641924466040"
 	emma:medium="acoustic"
	emma:mode="voice"
 	emma:function="dialog"
 	emma:verbal="true">
             <signal encoding="Base64">
                     UklGRoSdAgBXQVZFZm10IBAAAAABAAEAgD4AAAB9AAACABAAZGF0YWCdAgDz/wMAAwD4/xYA9v/4/+f/8v8HAA8ADQADAPv...
             </signal>
    </interpretation>
</emma:emma>

We could send multiple EMMA documents with parts of the audio to send the audio incrementally.

Received on Tuesday, 11 January 2022 20:50:22 UTC