- From: Deborah Dahl <Dahl@conversational-Technologies.com>
- Date: Wed, 27 Oct 2021 14:39:25 -0400
- To: <public-voiceinteraction@w3.org>
https://www.w3.org/2021/10/27-voiceinteraction-minutes.html
and below as text.
[1]W3C
[1] https://www.w3.org/
- DRAFT -
voice interaction
27 October 2021
[2]IRC log.
[2] https://www.w3.org/2021/10/27-voiceinteraction-irc
Attendees
Present
bev, debbie, dirk, kazuyuki, mustaq ahmed, paul grenier
Regrets
-
Chair
Debbie
Scribe
ddahl
Contents
1. [3]Breakout feedback and expected workshop
2. [4]Architecture document
Meeting minutes
Breakout feedback and expected workshop
<PaulG_> [5]https://www.w3.org/TR/spoken-html/
[5] https://www.w3.org/TR/spoken-html/
[6]https://lists.w3.org/Archives/Public/
public-voiceinteraction/2021Oct/0012.html
[6] https://lists.w3.org/Archives/Public/public-voiceinteraction/2021Oct/0012.html
debbie: review discussion from last week's breakout groups
[7]https://web-eur.cvent.com/event/
2b77fe3d-2536-467d-b71b-969b2e6419b5/
websitePage:efc4b117-4ea4-4be5-97b4-c521ce3a06db
[7] https://web-eur.cvent.com/event/2b77fe3d-2536-467d-b71b-969b2e6419b5/websitePage:efc4b117-4ea4-4be5-97b4-c521ce3a06db
<kaz> [8]https://www.w3.org/2021/10/20-voice-minutes.html
[8] https://www.w3.org/2021/10/20-voice-minutes.html
<kaz> [9]https://www.w3.org/2021/10/19-voice-minutes.html
[9] https://www.w3.org/2021/10/19-voice-minutes.html
debbie: possibility of a voice workshop
kaz: how to integrate speech API and SSML in a workshop
. organized session with voice interoperability session
kaz: decided to have a workshop, not voice but smart agent
workshop
. interoperability, voice interface, accessibility
. some overlap with semantic web? is that too broad?
. when we talk about smart agents
. one or two days, online
kaz: online workshop is much easier
<Bev> Perhaps hybrid online and in person?
kaz: usually takes six months or so, around May
<Bev> Include the Cognitive Inclusion COGA group
bev: could also do a hybrid event
. cognitive inclusion group has some overlap
<Bev> Information Architecture Community Group is also
supportive and can participate
kaz: should have a dedicated session on accessibility
debbie: to attend need to prepare a position paper and the
program committee will review
<Bev> anyone interested can prepare submission position
proposal to program committee
<kaz> [10]e.g., Smart Cities Workshop CfP
[10] https://www.w3.org/2021/06/smartcities-workshop/index.html
debbie: prerecorded videos with captions
. need to be provided
debbie: other topics like Open Voice Network
. could be included
paul: disambiguation in Spoken HTML spec, machine learning has
its own heuristics, but in the meantime author-controlled
pronunciation would be useful
paul: trying to get feedback from implementers, can't just
bring SSML into HTML
. will have some representation of SSML into HTML, especially
pronunciation
. could use this in machine learning
paul: word clusters could be modified by IPA
. a layer could map pronunciation to IPA
. and match to user's intent
. language, cultural information is missing
. when input happens, e.g. speech difficulty is like a
transform over standard language
. we can transform from word or from sound
. they could have had a stroke or something that altered their
speech
bev: iPads for elderly after dental surgery
. speech was different
. could we use this to transform speech
paul: for SpeechHTML this is the first step
. if the system doesn't find a match it could look for
transforms
. could be useful in a kiosk situation where user can't add
their preferences
kaz: two points, one for speech synthesis and one for speech
recognition
. for speech output it would be nice to have another layer to
get correct pronunciation
<Bev> Kaz: acoustic model
kaz: for speech input, we might want to include another
mechanism
<Bev> Kaz: command input expected actions, speech and gesture
kaz: such as hardware switch, gesture
debbie: also Natural Language Interfaces spec
<kaz> kaz: btw, it would be really nice if you all by chance
could join the Program Committee for the expected workshop :)
debbie: can join the program committee
paul: maybe could join
bev: could join program committee
. depends on timing
Architecture document
architecture document [11]https://w3c.github.io/
voiceinteraction/voice%20interaction%20drafts/
paArchitecture-1-2.htm
[11] https://w3c.github.io/voiceinteraction/voice interaction drafts/paArchitecture-1-2.htm
IPA means "intelligent personal assistant"
dirk: (reviews input architecture)
. provider selection strategies can be used to select providers
dirk: (goes through output path)
bev: question about intent sets
. could you talk about that a little more
dirk: information that could be used to fill in slots
bev: is that a standard?
dirk: for now this is pretty abstract
bev: would that include security information
dirk: thinking in terms of SISR, more like that
. have to distinguish between local intent sets and provider
intent sets
debbie: Emotion ML
debbie: could be used in input and output
kaz: don't have any specific comments, should discuss with
browser and speech vendors
. should present at workshop
. EMMA would be a good format for all this data
kaz: would like to integrate MMI architecture and SCXML for
interaction management with WoT standards for device management
. DID (decentralized identifier) standard, there are many
implementers, based on blockchain, should be a Recommendation
soon
. that can be used to identify users and devices, also
discovery can be handled this way
debbie: next call will be November 10
Received on Wednesday, 27 October 2021 18:39:40 UTC