Re: Backwards compatibility from Peter Lepeska on 2012-04-02 (ietf-http-wg@w3.org from April to June 2012)

From: Peter Lepeska <bizzbyster@gmail.com>
Date: Mon, 2 Apr 2012 13:56:39 -0400
To: ietf-http-wg@w3.org
Message-ID: <CANmPAYFTGKPTHOM+qnm+-G6yj8PajR88z-dgLMfB6UYjBOF9sQ@mail.gmail.com>
Big bites do seem to go down easier than lots of little ones. The problem
is that SPDY is eating *two* shit sandwiches, trying to make the web
both fast and secure, at the same time. This bite is more than most can
chew and so adoption will be much slower b/c of the SSL requirement, in my
opinion.

Peter

On Mon, Apr 2, 2012 at 1:31 PM, Mark Watson <watsonm@netflix.com> wrote:

>  All - the messages exchange below was supposed to be on-list - my
> mistake hitting reply instead of reply-all ...
>  On Apr 1, 2012, at 1:15 PM, Mike Belshe wrote:
>
>
>
> On Sat, Mar 31, 2012 at 9:47 AM, Mark Watson <watsonm@netflix.com> wrote:
>
>>  Mike, all,
>>
>>  This thread has gone into the weeds rather and is missing the point of
>> my original comment.
>>
>>  I did not intend a single throw-away paragraph to be a complete
>> technical
>> proposal.
>>
>>  My point was that deploying a new protocol at scale is hard. Look at
>> IPv6. It's not even mainly a technical problem. There are HTTP1.x-specific
>> assumptions throughout the network - people have paid money to put them
>> there, so presumably they have goals which would be undermined if large
>> amounts of traffic moved to a new protocol.
>>
>>  Whilst the fraction of HTTP1.x-compatible traffic stays close to its
>> current value you will not see deployment problems with new protocols. But
>> if you want to migrate large swathes of traffic to a new protocol, many
>> things have to be upgraded.
>>
>>  Before embarking on this, then, we should have a very firm idea of the
>> expected gains. Which means comparing with what can be achieved with a new
>> protocol to what can be achieved through simple extensions to the existing
>> one.
>>
>>  It seems to me, superficially, that several of the proposed
>> enhancements could be done this way.
>>
>>  It's true that there is a region where the difference between
>> 'extensions' and 'new protocol' is partly marketing. I'm not sure we should
>> go there. But it's also true there is a social engineering aspect to this
>> problem: people are often overly resistant to revolutionary changes and
>> prefer changes that appear evolutionary.
>>
>>  Having said all the above, it may be sufficient that there is
>> single-RTT fallback to HTTP1.1 in the presence of HTTP1.1 intermediaries.
>>
>
>  Heh - I think we're in more agreement than it might seem.
>
>  We had a philosophy when designing spdy:  "If you're going to eat a shit
> sandwich, take big bites".
>
>  What does that mean, you might ask?
>
>  Prior to starting SPDY, we had tried all sorts of incremental changes to
> HTTP - header compressors, data compressors, bundling, multiplexing, etc
> etc.  Some of these could be done with very small semantic changes to HTTP.
>  But, each of those semantic changes meant that every existing HTTP
> implementation out there (browsers, servers, or proxies) had to be made
> aware of the change and deal with appropriately...
>
>  In the end, the shitty part of changing HTTP is that changing the
> infrastructure is a ton of work (this is what you're rightly pointing out).
>  We knew we had several significant changes to make to HTTP.   Rather than
> doing them incrementally, and each one needing to figure out how to
> rechange the infrastructure, we decided taking one big bite is a preferred
> approach.  Solve all of these problems, but only change the infrastructure
> once.
>
>  I hope this metaphor isn't too off color and that it demonstrates the
> point.
>
>
>  MW: Sure. Fortunately I wasn't eating breakfast at the time ...
>
>
>
>  Regarding interleaved vs non-interleaved streams:  It sure seems easier
> to do what you're proposing, but I suspect that your proposal won't work.
>  For example, how would you do a comet-style hanging-GET without
> interleaved streams?
>
>
>  MW: I'm not familiar with exactly what that is, but I think the answer
> is use a separate connection.
>
>   This could be mitigated by opening up more parallel connections, but
> that is non-desirable too.
>
>
>  MW: I'm not really sure why. I can see that a parallel connections arms
> race is not a good idea - but we are all talking about things that reduce
> the need for parallel connections. Parallel connections are, also, a way to
> get a different overall congestion control behavior in a way that is
> reasonably safe.
>
>
>  BTW - did you mean to reply to all?
>
>
>  Yes, fixed.
>
>  Mike
>
>
>
>>
>>  ...Mark
>>
>>
>> Sent from my iPhone
>>
>> On Mar 30, 2012, at 9:18 PM, "Mike Belshe" <mike@belshe.com> wrote:
>>
>>
>>
>> On Sat, Mar 31, 2012 at 3:03 AM, Mark Watson <watsonm@netflix.com> wrote:
>>
>>>
>>>  On Mar 30, 2012, at 4:46 PM, Mike Belshe wrote:
>>>
>>>
>>>
>>> On Fri, Mar 30, 2012 at 6:53 PM, Mark Watson <watsonm@netflix.com>wrote:
>>>
>>>>
>>>>  On Mar 30, 2012, at 9:29 AM, William Chan (陈智昌) wrote:
>>>>
>>>> On Fri, Mar 30, 2012 at 6:13 PM, Mark Watson <watsonm@netflix.com>wrote:
>>>>
>>>>> All,
>>>>>
>>>>> I'd like to make a plea/request/suggestion that wherever possible new
>>>>> features be added incrementally to HTTP1.1, in a backwards compatible way,
>>>>> in preference to a "new protocol" approach. A "new protocol" is required
>>>>> only if it is not technically possible (or especially awkward) to add the
>>>>> feature in a backwards compatible way.
>>>>>
>>>>> The object should be to enable incremental implementation and
>>>>> deployment on a feature by feature basis, rather than all-or-nothing.
>>>>> HTTP1.1 has been rather successful and there is an immense quantity of code
>>>>> and systems - including intermediaries of various sorts - that work well
>>>>> with HTTP1.1. It should be possible to add features to that code and those
>>>>> systems without forklifting substantial amounts of it. It is better if
>>>>> intermediaries that do not support the new features cause fallback to
>>>>> HTTP1.1 vs the alternative of just blocking the new protocol. In
>>>>> particular, it should not cost a round trip to fall back to HTTP1.1. It is
>>>>> often lamented that the Internet is now the "port-80 network", but at least
>>>>> it is that.
>>>>>
>>>>
>>>>  Don't forget port 443. And I agree, it should not cost a round trip
>>>> to fallback to HTTP/1.1.
>>>>
>>>>
>>>>>
>>>>> Many of the features contemplated as solutions to the problems of
>>>>> HTTP1.1 can be implemented this way: avoiding head-of-line blocking of
>>>>> responses just requires a request id that is dropped by intermediaries that
>>>>> don't support it and echoed on responses. Request and response header
>>>>> compression can be negotiated - again with a request flag that is just
>>>>> dropped by unsupporting intermediaries. Pipelined requests could be
>>>>> canceled with a new method. These things are responsible for most of the
>>>>> speed improvements of SPDY, I believe.
>>>>>
>>>>
>>>>  It's unclear to me how this would work. Are you suggesting waiting a
>>>> HTTP request/response pair to figure out if the id gets echoed, before
>>>> trying to multiplex requests? Or would you rely on HTTP pipelining as a
>>>> fallback if the ids don't get echoed?
>>>>
>>>>
>>>>  Send the requests (yes, pipelined). If they come back without ids,
>>>> then they are coming back in the order they were sent. If they come back
>>>> with ids, then that tells you which response is which.
>>>>
>>>
>>>  You can't do this until you've got confirmation that the server is
>>> going to give you a HTTP/1.1 response.  It could come back HTTP/1.0.
>>>
>>>  So do we first have to do a 1.1 request successfully (with 1.1
>>> response) before we can ever attempt to do a pipelined upgrade?
>>>
>>>
>>>  For each server, yes. Servers don't often get downgraded from 1.1 to
>>> 1.0, so you could cache that result for quite a while.
>>>
>>>
>>>
>>>>
>>>>  The former incurs a large latency cost. The latter depends very much
>>>> on how deployable you view pipelining on the overall internet.
>>>>
>>>>
>>>>  It's certainly widely deployed in servers and non-transparent
>>>> proxies. Non-supporting non-transparent proxies are easily detected. Yes,
>>>> broken transparent proxies are a (small) problem, but you can also detect
>>>> these.
>>>>
>>>>  I am skeptical it is sufficiently deployable and we on Chromium are
>>>> gathering numbers to answer this question (http://crbug.com/110794).
>>>>
>>>>
>>>>  Our internal figures suggest that more than 95% of users can
>>>> successfully use pipelining. That's an average. On some ISPs the figure is
>>>> much lower.
>>>>
>>>
>>>  Do you a research result to cite here?  Sounds interesting.  5%
>>> failures is pretty high.
>>>
>>>
>>>  No, these are just internal figures right now. Yes, it does seem high,
>>> I've a feeling many of those are false negatives where we avoid pipelining
>>> unnecessarily.
>>>
>>>
>>>
>>>
>>>>
>>>>  Also, pipelining is clearly inferior to multiplexing.
>>>>
>>>>
>>>>  Yes, but perhaps in practice not by much. To render a page you need
>>>> all the objects, so from a time-to-page-load perspective it makes no
>>>> difference how you multiplex them, as long as the link remains fully
>>>> utilized. To see some difference you need some notion of object importance
>>>> and some metric for 'page loaded except for the unimportant bits'. You send
>>>> the most important requests first. Even then it's not clear that
>>>> multiplexing within objects will perform significantly better than object
>>>> by object sending.
>>>>
>>>
>>>
>>>  Don't forget that pipelining does *not* apply to all resources.  Even
>>> when pipelining works end-to-end, browsers need to take great care not to
>>> accidentally pipeline a critical resource behind a slow one (like a hanging
>>> GET).  This leads to browsers doing tricks like "only pipeline images
>>> together" or other subsets of pipelining.
>>>
>>>
>>>  I was assuming you could avoid the head-of-line blocking with an
>>> extension that allows out-of-order responses.
>>>
>>>
>>>  But when we consider pipelining a fallback for SPDY, this all falls
>>> apart.  SPDY did not have these restrictions.  So now, SPDY would need to
>>> run in some sort of degraded mode for what types of requests are pipelined
>>> just so fallback to a HTTP/1.1 protocol that the server might not support
>>> (because it could be HTTP/1.0) or which the user might not support because
>>> he's one of the unlucky 5% (according to Mark's data) where pipelining just
>>> breaks altogether.
>>>
>>>  All in all, we've now compounded 3 unique restrictions on the initial
>>> set of requests in order to work around past bugs in order to support use
>>> of the Upgrade header.
>>>
>>>  Realistically, you're going to get one request on the upgrade, and
>>> you'll have to wait to open up the parallel requests.  This is a
>>> significant restriction of the Upgrade process - it requires a round trip
>>> to kick into the real protocol at full gear.
>>>
>>>  This is highly annoying, but for initial web page loads, probably
>>> won't be a significant burden because the browser initially only has one
>>> URL.  For page reloads, or validations, or subsequent pages on reconnect,
>>> it will be a performance hit.
>>>
>>>
>>>
>>>
>>>>
>>>>
>>>>
>>>>> Interleaving within responses does require some kind of framing layer,
>>>>> but I'd like to learn why anything more complex than interleaving the
>>>>> existing chunked-transfer chunks is needed (this is also especially easy to
>>>>> undo).
>>>>>
>>>>
>>>>  Sorry, I'm not sure I understand what you mean by interleaving
>>>> existing chunked-transfer chunks. Are these being interleaved across
>>>> different responses (that requires framing, right?).
>>>>
>>>>
>>>>  Interleaving data from multiple responses requires some kind of
>>>> framing, yes. Chunked transfer encoding is a kind of framing that is
>>>> already supported by HTTP. Allowing chunks to be associated with different
>>>> responses would be a simple change. Maybe it feels like a hack ? That was
>>>> my question: why isn't a small enhancement to the existing framing
>>>> sufficient ?
>>>>
>>>>
>>>  Even if you could hack it into a chunk, thats a real jumbled mess.
>>>  Why do you want to do this?  It doesn't give you backward compatibility in
>>> any way (existing browsers won't know what to do with these nonstandard
>>> chunks anyway), its just a mess for the sake of a mess?
>>>
>>>
>>>  So, your answer to my question is fairly clear ;-)
>>>
>>>  It doesn't feel like such a 'mess' to me - we're talking about
>>> negotiating use of new protocol elements. They're only used if both ends
>>> support them so, yes, the only kind of backwards compatibility is that the
>>> use of framing is negotiated, rather than assumed from the start. My point
>>> was that you don't need a whole shim layer to do this, because HTTP already
>>> has framing. Perhaps it makes little difference, but it means you can
>>> develop and deploy functionality incrementally, rather than all-or-nothing.
>>>
>>
>>  Your approach is just out-of-order pipelining, right?  It's not an
>> interleaved multiplexing system.  And you're right, you don't necessarily
>> need a full framing layer to support that.  (unless you want flow control,
>> which you probably do, but haven't considered yet)
>>
>>  We can do a lot better than that, thats all.
>>
>>  BTW - more than one implementor has come to me and said, "wow - spdy
>> framing was really easy to implement".  It's not like the framing layer is
>> a hard concept.
>>
>>  I guess overall - I'm just not sure what your goals are.  You seem to
>> want it to look like HTTP even though it won't be HTTP and even though you
>> sacrificed a key part of the performance.  But what is the point of that?
>>  You're no longer trying to make it as fast as you can, so who is your
>> target market?
>>
>> Mike
>>
>>
>>
>>
>>>
>>>
>>>>
>>>>>
>>>>> Putting my question another way, what is the desired new feature that
>>>>> really *requires* that we break backwards compatibility with the extremely
>>>>> successful HTTP1.1 ?
>>>>>
>>>>
>>>>  Multiplexing,
>>>>
>>>>
>>>>  See my question above
>>>>
>>>>  header compression,
>>>>
>>>>
>>>>  Easily negotiated: an indicator in the first request indicates that
>>>> the client supports it. If that indicator survives to the server, the
>>>> server can start compressing response headers right away. If the client
>>>> receives a compressed response it can start compressing future requests on
>>>> that connection. It's important that this indicator be one which is dropped
>>>> by intermediaries that don't support compression.
>>>>
>>>>  prioritization.
>>>>
>>>>
>>>>  I think you mean "re-priortization". I can send requests in priority
>>>> order - what I can't do is change that order to response to user actions.
>>>> How big a deal is this, vs closing the connection and re-issuing
>>>> outstanding requests in the new order ?
>>>>
>>>
>>>  Its the difference between web pages rendering faster or slower.
>>>  Load up 100 image requests on your twitter page, and then fetch the images
>>> before the JS.  The page loads slower unless you lower the priority of the
>>> images.  But you still don't want to add serialization delays that HTTP has.
>>>
>>>  BTW - the effects of priorities has been measured, and you're welcome
>>> to use the existing benchmarking harness to verify yourself that these
>>> things are true in real code rather than just theory.  (see
>>> dev.chromium.org/spdy).  I wish I had published the tests when I did
>>> this long ago - spent a lot of time on it.
>>>
>>>
>>>  Again, I don't think you need anything more than the basic possibility
>>> to return responses out-of-order to get most of the gains.
>>>
>>
>>
>>
>>>  Send the requests in priority order and have the server return them in
>>> priority order, unless a response is not available in which case other
>>> responses can push ahead. The absence of interleaving within responses just
>>> reduces the granularity. Request the JS first, then the 100 images. With
>>> interleaving, if the JS is available half way through sending image 3, we
>>> can start sending the JS right there. Without interleaving you have to wait
>>> until the end of image 3.
>>>
>>>  What you don't have is, as I said, "re-prioritization", where the
>>> client can change its mind about the priority order after sending the
>>> requests - you'd have to close the connection and send the requests again.
>>>
>>>  Not perfect, but I feel you could get a good chunk of the gains, with
>>> out-of-order responses and negotiated compression.
>>>
>>>  Set aside that the significant advantages of small incremental changes
>>> to a well-understood, widely deployed, very successful protocol vs
>>> invention and all-at-once deployment of a new one.
>>>
>>>  …Mark
>>>
>>>
>>> Mike
>>>
>>>
>>>
>>>>
>>>>  …Mark
>>>>
>>>>
>>>>
>>>>>
>>>>> …Mark
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>
>>>
>>
>
>
Received on Monday, 2 April 2012 17:57:14 UTC