Re: H2 Server Push performance data from Tom Bergan on 2019-07-23 (ietf-http-wg@w3.org from July to September 2019)

From: Tom Bergan <tombergan@chromium.org>
Date: Mon, 22 Jul 2019 17:17:31 -0700
To: "Nanner, Aman" <ananner@akamai.com>
Cc: "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Message-ID: <CA+3+x5HWR4SytJeKcqHD8Q=EN84854BuyFw6SRx7RKx1QJRyZw@mail.gmail.com>
Did you at least do an A/A comparison to test if your regression introduces
bias?

On Mon, Jul 22, 2019 at 2:27 PM Nanner, Aman <ananner@akamai.com> wrote:

> Hi Tom,
>
> Thanks for your feedback!
>
> The first couple of charts (based on the multiple customer sites) use the
> mean as a way to provide a summary view of performance for each individual
> site.  The last chart in the article is a percentile plot for one customer
> site which demonstrates a more granular view, but we do not have percentile
> plots put together for each customer site. It should be possible to do that
> in the event we were to perform deeper analysis on more sites.
>
> The A/B data was not collected in a way that we would consider to be pure
> enough to be considered i.i.d coin flips. There exist dimensions where the
> values can make it more or less likely that Server Push may be applied on
> specific responses within the production network (e.g. geo, time of day,
> user-agent), and so the approach was chosen to use linear regression to
> attempt to control for these variables.
>
> I agree that the FirstContentfulPaint metric would be useful to analyze;
> we had some limitations in our dataset regarding collection of that metric
> that resulted in it not being reliable enough to use (it wasn't as widely
> collected as DOMComplete). There have been some encouraging results seen
> for this metric based on some preliminary analysis, but we wouldn't
> consider the data and analysis reliable enough yet for sharing.
>
> Thanks,
> Aman
>
>
> ------------------------------
> *From:* Tom Bergan <tombergan@chromium.org>
> *Sent:* Monday, July 22, 2019 2:02 PM
> *To:* Nanner, Aman
> *Cc:* ietf-http-wg@w3.org
> *Subject:* Re: H2 Server Push performance data
>
> Hi Aman,
>
> Interesting results, thanks for sharing! I do wonder why you decided to
> compare means instead of percentiles? It looks like there are some crazy
> bad outliers. Assuming your "April 2019 Results with Old Methodology" and
> "April 2019 Results (1% quantile excluded)" graphs use the same set of
> sites in the same order, the mean goes from about +500ms to -200ms. If 99%
> of requests on that site take -200ms, the last 1% must take 70s! Instead of
> means, you could have compared medians, or 25th/75th percentiles, or other
> percentiles in the distribution. You can compute confidence intervals on
> percentiles. You can also do something like Mann-Whitney to check if the
> distributions differ significantly.
>
> I'd also like to hear more about this linear regression that you run on
> the A/B results. I couldn't follow why this regression is necessary. If
> your A/B test uses i.i.d. coin flips, you can compute confidence intervals
> directly on the A/B results. Is your A/B test not i.i.d.? How do you know
> the linear regression doesn't introduce bias?
>
> Also, have you looked into measuring other metrics, such as
> FirstContentfulPaint?
>
> -Tom
>
> On Mon, Jul 22, 2019 at 4:41 AM Nanner, Aman <ananner@akamai.com> wrote:
>
>> At the previous IETF 102 HTTPWG session in Montreal, I had presented some
>> Akamai data on H2 Server Push which can be found here:
>>
>>
>> https://github.com/httpwg/wg-materials/blob/gh-pages/ietf102/akamai-server-push.pdf
>> <https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_httpwg_wg-2Dmaterials_blob_gh-2Dpages_ietf102_akamai-2Dserver-2Dpush.pdf&d=DwMFaQ&c=96ZbZZcaMF4w0F4jpN6LZg&r=HVAbQ4bnkmcYKMR6_H0nEMfpVIpY1jgJWVBhC_es5Nw&m=JhnrDdJ2FDK0-S_OawS0GoKyXIVn4C8H_c03AveZDRo&s=UzEaCDDavuWPtu3zi_hdJa51YwKMwdzescfn9rpYnlQ&e=>
>>
>> Akamai has conducted some more recent tests with a tweaked methodology
>> (exclusion of 1% highest-latency requests on the long-tail), and we have
>> found some interesting results. I share some more details about the
>> performance analysis here:
>>
>>
>> https://medium.com/@ananner/http-2-server-push-performance-a-further-akamai-case-study-7a17573a3317
>> <https://urldefense.proofpoint.com/v2/url?u=https-3A__medium.com_-40ananner_http-2D2-2Dserver-2Dpush-2Dperformance-2Da-2Dfurther-2Dakamai-2Dcase-2Dstudy-2D7a17573a3317&d=DwMFaQ&c=96ZbZZcaMF4w0F4jpN6LZg&r=HVAbQ4bnkmcYKMR6_H0nEMfpVIpY1jgJWVBhC_es5Nw&m=JhnrDdJ2FDK0-S_OawS0GoKyXIVn4C8H_c03AveZDRo&s=5ScQ7t8fvPbCcHySFH1WDh9iHpnjchASXvAmdo8mvJM&e=>
>>
>> Thanks,
>> Aman Nanner
>> Akamai Technologies Inc.
>>
>>
Received on Tuesday, 23 July 2019 00:18:08 UTC