- From: Jeffrey Mogul <mogul@pa.dec.com>
- Date: Tue, 19 May 98 12:01:55 MDT
- To: http working group <http-wg@cuckoo.hpl.hp.com>
Dave Morris writes:
Is anyone aware of any effort to derrive a list of client software
packages which include the "Host:" header with requests? It would
be really helpful for my efforts to push server usage away from
discrete IP addresses to be able to speak with some degree of
authority.
Nice to know what percentages of requests to major net destinations
like Yahoo, Altavista, Infoseek, Excite, Netscape, IBM, Microsoft,
etc. now include the HOST: header.
I thought about asking the folks at AltaVista for this info, but then I
realized that it would probably require reprogramming their server, and
this is something they like to avoid.
But by a strange coincidence, this week one of the other researchers
here yesterday started logging all of the HTTP headers through our
proxy here in Palo Alto. And so I put together an AWK script to figure
out which User-Agents send Host.
By the way, we're not going to release these logs, no matter how nicely
people ask ... so please don't ask. And also don't ask me what
fraction of requests were HTTP/1.1; the logs apparently don't include
that tidbit.
Out of 1963154 requests logged yesterday (May 18), 1946195 (99.1%) had
User-Agent headers. For these requests, I looked for a Host header
(but without checking on its syntactic or semantic validity!), and then
made a list of User-Agent values associated with requests with or
without Host.
In some cases, the same User-Agent was seen with and without the Host
header. I'm not really keen on doing the analysis to figure out what
really happened in all cases, but I looked at one (more or less random)
example; it looks like when that particular browser is invoked by
Quicken, it changes the request headers fairly significantly. I
suspect that in other cases, some intervening proxy (we have several
layers of internal proxies within Digital) either added or removed Host
headers.
In any case, this means that these results should be taken with a large
dose of skepticism, since it appears that one cannot simply assume that
use of a given User-Agent will always result in the delivery of a Host
header to the origin server.
For these lists, I used only the first "word" (whitespace-delimited
string of characters) in the User-Agent header. I tried analyzing a
subset of the log using the entire User-Agent header; it doesn't seem
to add much information, but it slows things down a lot.
Disclaimer: nothing here is meant as a criticism of any User-Agent
implementation, especially since my analysis could be erroneous.
-Jeff
User-agents that were never seen with a Host header:
0101500608win16001
0101600719win16001
0101600719win16014
0101600719win16042
0101600720win16001
0102001290win32001
AVSMCPX
Crescent
Enhanced_Mosaic
FFiNet32.DLL/3.1
Go-Ahead-Got-It/1.1
Lotus
Lynx/2.3
Lynx/2.3-FM
Lynx/2.3.6
Marimba
Mozilla/1.12I
Mozilla/1.2
Mozilla/1.22
Mozilla/3.0b9Gold
NCSA
NSPlayer/2.0
PCNviewer
Proxy
QFNApp/1.0
TSTPAV
Tuner/1.1.1
Tuner/2.0.2
Update
VXtreme,
Visto-Assistant/Commercial-Release-2.0
WebCopy/0.98b7
www.pl/961205.24
User-agents that were always seen with a Host header:
0102001735win32090
0102001737win32001
0102001737win32011
0102001737win32024
0102001790win32001
0102001790win32051
0102001790win32090
0102001792win32001
0102001792win32015
0102001792win32024
0102001792win32042
0102001792win32043
0102001792win32073
0102502226win32001
@%146%01L%146%01%7c%146%01%94%146%01
Alexa
Alexa/1.1.4.0%3bMicrosoft
AlphaCONNECT
BW-C-2.0
CSymWebPage
Caching-Manager/2.1
Conveyer
DMS-NetLink-GetLink
HotJava/1.1
InstallShield
Investor
Java1.0.2
Java1.1
Java1.1.4
LiveUpdate
Lotus-Notes/4.5
Lynx
Lynx/2.5FM
Lynx/2.6
Lynx/2.7.1
Lynx/2.7.2
Lynx/2.8rel.2
MFC_Tear_Sample
MSFrontPage/2.0
MSFrontPageWpp/3.0
MSInvestor
MSN
MSNBC-News-Alert/2.2
MSNBC-News-Browser-IE/2.1
MSWebPostPostInfoProcessor/1.5
Mozilla/1.1
Mozilla/2.01Gold
Mozilla/3.01-C-MACOS8
Mozilla/3.01C-DH397
Mozilla/3.01C-KIT
Mozilla/3.01C-WorldNet
Mozilla/3.03
Mozilla/3.04GoldC
Mozilla/3.0C-NC320
Mozilla/3.0C-WorldNet
Mozilla/4.04j2
NeoPlanet
NetAttache/2.5
Net_Vampire/2.4
OilChange
PhotoImpact
PrimeNet3Win32
RealPlayer
Registration
Scooter/1.1
ServerComm
SunLab's
Teleport
UPDATEIT
WebTrends/3.0
WebZIP/2.0
Wget/1.4.5
contype
d%f0D
libwww-perl/5.20
xmcd/v2.2PL1
User-agents that were sometimes, but not always, seen with a Host header:
GetRight/3.1
Microsoft
Mozilla/2.0
Mozilla/2.01
Mozilla/2.02
Mozilla/2.02Gold
Mozilla/3.0
Mozilla/3.01
Mozilla/3.01C-POLNET
Mozilla/3.01C-SI304A01
Mozilla/3.01Gold
Mozilla/3.02
Mozilla/3.02Gold
Mozilla/3.03Gold
Mozilla/3.04
Mozilla/3.04Gold
Mozilla/3.0Gold
Mozilla/4.0
Mozilla/4.01
Mozilla/4.02
Mozilla/4.03
Mozilla/4.04
Mozilla/4.05
NSPlayer/3.0.0.2437
Tuner/2.1.2
Received on Tuesday, 19 May 1998 12:07:10 UTC