- From: Jeffrey Mogul <mogul@pa.dec.com>
- Date: Tue, 18 Feb 97 15:20:20 PST
- To: Koen Holtman <koen@win.tue.nl>
- Cc: http-wg%cuckoo.hpl.hp.com@hplb.hpl.hp.com
On a micro level: I listed some (fixable) technical problems with the previous draft in http://www.ics.uci.edu/pub/ietf/http/hypermail/1996q4/0294.html As far as I can see, most of these problems have not been fixed in the new draft. Did the message above somehow drop out of the author's editorial queue? It is, in fact, still in my in-box, so I'll reply to that in my next message. Whether I would define these as "(fixable) technical problems" or "differences of opinion" is another question :-) 1) Section 4 says: `We believe that our design provides adequate support for user-counting, based on the following analysis.' I do not think it does (for a longer discussion, see the article linked above), and as long as this claim stays in, I won't support the draft. Many people want web metrics better than what have now, but this draft does not provide such metrics. Vendors say they get pressure for better metrics from their customers; I don't think implementing this draft will make the pressure go away. We can quibble about whether the design does indeed provide adequate support for counting users of a page. Perhaps the right statement would be We believe that our design provides adequate support for user-counting, within the constraints of what is feasible in the current Internet, based on the following analysis. As you stated yourself in the message cited above, I don't think any system can get you good user counts unless it includes per-user global browsing history logs of many megabytes. If by "good" you mean "100.00% accurate", then I'm inclined to agree with you, and since this is clearly infeasible, one has to settle for what is possible. We prefer to define "adequate" as "at least as accurate as is currently possible", given that cache-busting leads to over-counting in many cases. 2) I feel that there is too much unnecessary cruft in the draft. The usage limiting stuff should be removed, and the special rules for varying resources should probably also be removed. Some people seem to prefer hit-counting over usage-limiting; some people prefer the opposite. There is no clear consensus that one obviates the other. Since both seem (to us) to be best served by slight variations of a single basic mechanism, we believe that it is appropriate to include both in the proposal. As for section 8, "Interactions with varying resources": this simply states the bare minimum necessary to make sensible use of the Vary mechanism as it is currently defined in the HTTP/1.1 RFC. 3) To quote Roy Fielding: The other harm I mentioned is the implicit suggestion that "hit-metering" should be sanctioned by the IETF. It should not. Hit metering is a way for people who don't understand statistical sampling to bog down all requests instead of just those few requests needed to get a representative sample. Whether or not some ISP customers want it does not change the fact that it is damaging to the community as a whole, and it's a lot better to inform people on how not to be a "scum sucking pig" than it is to have a proposed standard on slightly-less piggish ways to be a pig. I feel that the IETF should not sanction this form of hit metering (by making it a proposed standard) _unless_ it can be shown that not doing so will lead to an internet meltdown. Since neither you nor Roy attended the San Jose session in February, and (although I supplied them in machine-readable form) the slides I presented there have not been posted as part of the minutes, I will quote from them here: o Cons (real or alleged) [of our proposal] - Slight overhead on the wire * This either pays off, or people won't use it - Some storage overhead - May reduce pressure on service authors to adopt more complex proposals - May not provide enough information to attract wide use o Last two "cons" cannot both be true! To be specific, in this message, you yourself have stated "Many people want web metrics better than what have now, but this draft does not provide such metrics." and "if the draft is adopted, some people who will do cache busting now will switch to the hit counting methods in the draft." You simply can't have it both ways. Either the draft is useful to a significant number of people, or it is not. I wouldn't waste the WG's time discussing proposals about "statistical sampling" until such time as we have seen a specific proposal. Also, adopting the draft may slow the introduction of a better demographics system later. A better system does not necessarily have to be based on HTTP extensions either. However, I have no high hopes of a better system happening very soon, or at all. The social issues that need to be resolved are even more complex than the technical issues. Pending a specific proposal (in the form of an Internet-Draft?) for a "better system", it's pointless to discuss whether adopting our draft would slow the introduction of something better. Especially since you and I seem to agree that a "better system" isn't likely soon. -Jeff
Received on Tuesday, 18 February 1997 15:38:30 UTC