Specifying the audio buffer size from Charlie Kehoe on 2015-04-21 (public-media-capture@w3.org from April 2015)

From: Charlie Kehoe <ckehoe@google.com>
Date: Tue, 21 Apr 2015 00:32:04 +0000
To: public-media-capture@w3.org
Cc: Justin Uberti <juberti@google.com>, Rahul Chaturvedi <rkc@google.com>, Andrew Bunner <abunner@google.com>
Message-ID: <CAGNr40qz0moT7+O1BKOz_qh8TvfZf0JXxUbtb6jTYnNz=8yUpA@mail.gmail.com>

Some applications involve listening to audio for a potentially extended
period of time (with user consent, of course), and are not particularly
latency-sensitive. An example would be the "Ok Google" hotwording available
on the Chrome new tab page, or other types of continuous speech
recognition. For these applications, a typical low-latency audio
configuration can lead to excessive power usage. I've measured 20% CPU
usage for audio capture in Chrome, for example.

My proposed solution is to offer a way to change the audio buffer size.
This enables a tradeoff between latency and power usage. For example, a
member could be added to MediaTrackConstraintSet
<http://w3c.github.io/mediacapture-main/getusermedia.html#dictionary-mediatrackconstraintset-members>
:

dictionary MediaTrackConstraintSet {
   ...
   audioBufferDurationMs of type ConstrainLong
};

This would be an integer number of milliseconds. Perhaps the name could
mention latency instead (e.g. audioLatencyMs).

How does this simple change sound?

- Charlie

Received on Tuesday, 21 April 2015 08:36:14 UTC