RE: [EME] reuse of session from Maruyama, Shinya on 2014-06-13 (public-html-media@w3.org from June 2014)

From: Maruyama, Shinya <Shinya.Maruyama@jp.sony.com>
Date: Fri, 13 Jun 2014 03:34:59 +0000
To: David Dorwin <ddorwin@google.com>
CC: "public-html-media@w3.org" <public-html-media@w3.org>
Message-ID: <E7496ED336482E459A47A005EAE8161204EAAB@JPYOKXMS121.jp.sony.com>

Does the KID array really help? The user agent would still need to (asynchronously) ask the CDM if it has each key ID. Even if it did, this only addresses a subset of content in one particular format.

I’m not sure why you think the user agent needs to ask the CDM.
Current model does not take care of “active session Initialization Data“ delivering what set of KIDs to CDM () or does not ensure the preceding session have completed successfully. It just relies on the same initData will result in the same license and don’t care the result of pending license (createSession is resolved with null even though the preceding session may fails to acquire the license).
KID-granular comparison is basically the same. The initData should result in the license delivering all the KIDs contained in the initData (maybe it delivers extra KIDs though). It just ensures that the KIDs listed in UA will be or have been made available to CDM. This is the same assumption which the current model relies on. If a preceding license delivers extra KIDs, unnecessary session may be created. However it is not worse than current model, either.
I don't see how it is better and thus why we should add special behavior for one format (CENC) or a dependency on CENC second edition. (Actually, WebM would work fine because the initData is a KID.)

If, for example, audio and video streams are encrypted with different keys, they will have different PSSH boxes with different KID values. If the first session results in a license for both keys, the application and user agent will not know this. Only the CDM knows that it already has keys for both KIDs. Thus, the user agent can't do anything with knowledge of the KIDs in the initData. It would still create multiple sessions because the KIDs are different just as the entire initData is different.

The case above is a bad practice we cannot de-dup licenses.

The one case where this might not be true is if there was fake initData (i.e. from a manifest) that contained both KIDs. Maybe this is what you are referring to below. However, this can probably be addressed by using a real PSSH box from one of the streams. If the license server is capable of returning a license for all KIDs based on a PSSH box containing just one of them, there is no reason to include all the KIDs in the manifest (if you are concerned about duplicate sessions in the key rotation case).

The best practice I mentioned below is the case KID comparison in user agents gives much help.
Actually, DASH-IF and common encryption 2nd edition are addressing the delivery of all the KIDs in the manifest.

The biggest advantage on introducing KID-granular comparison is that it helps to realize a best practice. For example, if manifest file delivers the pssh containing all KIDs for the presentation, the application can first call createSession with the pssh. Then, the subsequent media segment does not cause unnecessary sessions even though pssh is contained in moov or moof box.
Raw initData comparison cannot make it because pssh is different among manifest, moov and moof.
That's interesting - why are they different?

Is it because, as you said above, that the PSSH box from the manifest? What does the PSSH box in the moov contain? Why does the moov need a PSSH box?

Yes, the first pssh extracted from the manifest contains all the KID for the presentation because the manifest is a something to cover the entire streams.
Subsequent pssh may come from either moov or moof to support random access or trick play. Typically, as media segment contains a single track, those pssh’s are not the same unless the particular constraint is specified/operated like HbbTV restricting the Initialization Segment to being the common among all representations.

Received on Friday, 13 June 2014 03:36:46 UTC