- From: Roberto Peon <grmocg@gmail.com>
- Date: Tue, 23 Oct 2012 08:29:28 -0700
- To: Martin Nilsson <nilsson@opera.com>
- Cc: ietf-http-wg@w3.org
- Message-ID: <CAP+FsNfn2d1zCAPrDd=Wp2+cF448A=_MW9RzS1kP_iAP0rMPAw@mail.gmail.com>
What Martin said... with special emphasis on the difficulty of sanitizing the headers. :) The other complication is that we want to see what an entire interaction looks like (for compression), and not just the initial page load (unless that is how the users typically interact with that site) -=R On Oct 23, 2012 12:57 AM, "Martin Nilsson" <nilsson@opera.com> wrote: > On Mon, 22 Oct 2012 18:36:40 +0200, Vinayak Hegde <vinayakh@gmail.com> > wrote: > > >>> We are wondering whether anyone is aware of a test corpus for HTTP >>> exchanges that would be available, or could be made available. This would >>> help us obtaining fair and realistic results. >>> >> >> The internet archive stores the HTTP response as it is received and >> has a huge trove of results. However I don't know if it is available >> for download and replay. The arc file format is documented here - >> http://crawler.archive.org/**articles/developer_manual/**arcs.html<http://crawler.archive.org/articles/developer_manual/arcs.html> >> >> > As discussed previously, the headers in the HTTP request may be more > relevant to compress, since you want to fit as many requests as possible in > the initial TCP window. In the response the headers are many times dwarfed > by the size of the actual resource. Getting a representative sample for > test purposes quickly gets difficult for privacy reasons, in particular if > you capture mobile traffic, where unique identifiers like phone number > could be added to the requests by the network operator. > > /Martin Nilsson > > -- > Using Opera's revolutionary email client: http://www.opera.com/mail/ > >
Received on Tuesday, 23 October 2012 15:30:00 UTC