Re: Question about HTTP 2.0 priority from Greg Wilkins on 2014-10-10 (ietf-http-wg@w3.org from October to December 2014)

From: Greg Wilkins <gregw@intalio.com>
Date: Fri, 10 Oct 2014 11:12:08 +1100
To: Chad Austin <caustin@gmail.com>
Cc: Mike Bishop <Michael.Bishop@microsoft.com>, Martin Thomson <martin.thomson@gmail.com>, Amos Jeffries <squid3@treenet.co.nz>, HTTP Working Group <ietf-http-wg@w3.org>
Message-ID: <CAH_y2NFhyVTEpPJemrfh5K69KuxTcM-XCBBA9qky3qrUpNhazg@mail.gmail.com>

On 10 October 2014 05:35, Chad Austin <caustin@gmail.com> wrote:

> And, of course, the most interesting thing about this entire discussion is
> that managing a request priority queue is often very easy for both clients
> and servers.

Actually it can be very hard for a server and there are some tough issues
to deal with.  Priority is going to be a very hard nut to crack on the
Server.    From a clients point of view it is mostly about optimising round
trips and network utilisation, but a server has to consider overall through
put as well as keeping individual connections optimal.     It is probably
not sensible for a server to throw all of it's cores at a single connection
to optimally drive the network utilisation for 1 client when it may have
99,999+ other client connections to consider.

Thus I believe that servers are going to have to limit the degree of
parallelism they are prepared to allocated to a single connection,
otherwise HTTP/2 will have very bad mechanical sympathy.

So lets say that we have a queue of 30 request in priority order.    A
server may wish to limit the parallelism that it gives to this, so it may
allocate threads only to the first few requests and it makes sense to pick
them in priority order.    But what if those high priority requests are
slow to generate?   Should the server still wait for them to complete even
though the network connection for that client is idle at if it allocated a
thread to a lower priority resource it might be able to progress?

This will of course be further complicated by push, where 29 of those
requests are generated from push_promises from information that the server
knows about the base request, which could include time to generate
dependencies and priorities assigned on previous pushes.  A server might
know that it has some low priority quick resources that it can dispatch and
server in the time it takes some high priority slow resources to be
generated.  Static resources might be able to be asynchronously pushed
while generated resources may need a thread, further complicating the
resource allocation.

cheers

-- 
Greg Wilkins <gregw@intalio.com>  @  Webtide - *an Intalio subsidiary*
http://eclipse.org/jetty HTTP, SPDY, Websocket server and client that scales
http://www.webtide.com  advice and support for jetty and cometd.

Received on Friday, 10 October 2014 00:12:37 UTC