some data for todays meeting

From: Hussein Suleman (hussein@vt.edu)
Date: Fri, Oct 08 1999


Date: Fri, 08 Oct 1999 11:20:44 -0400
From: Hussein Suleman <hussein@vt.edu>
To: www-wca@w3.org
Message-id: <37FE0BCC.2054DC64@vt.edu>
Subject: some data for todays meeting

hi

for today's meeting im going to make reference to the following
fragments of data sets that i ran through the current
validator/cleaner/canonicalizer program ...

(nb. validator because it checks sanity of fields, cleaner because it
eliminates lines with errors, canonicalizer because it generates a
differently formatted output)

1. this is an extract from the one of the boeing log files (which were
anonymized with log2anon)

28756.552       290     1202    18      6356819 1       13429   8993   
200     1
10      2       2
28762.777       202     429     24      3253    1       290     290    
302     1
10      2       3
28763.070       805     14006   66118   2146924 1       1430    1430   
200     1
10      2       3
28763.940       105     242     18      6356820 1       13842   3910   
302     1
10      2       1
28764.606       338     14192   18      17802   1       353     353    
200     1
10      2       3

2. this is the report from the validator

---General Statistics---
Total number of lines       : 256206
Lines with too many fields  : 0
Lines with too few fields   : 0
Lines with errors           : 12943
First 10 lines with errors : 137 139 165 489 506 507 508 509 515 518

---Field Errors---
18 : Response -- Invalid response -- 12943 lines [137 139 165 489 506
507 508 509 515 51
8 ...]

3. this is the canonical file generated in ECLF format

- - - [-/-/-:-:-:- -] "-" 200 1202 "-" "-"
- - - [-/-/-:-:-:- -] "-" 302 429 "-" "-"
- - - [-/-/-:-:-:- -] "-" 200 14006 "-" "-"
- - - [-/-/-:-:-:- -] "-" 302 242 "-" "-"
- - - [-/-/-:-:-:- -] "-" 200 14192 "-" "-"

obviously it doesnt look too good. but thats because of the particular
nature of what was logged in the boeing files as compared to what we are
used to seeing in traditional log files. i will talk more about this
during the meeting.

ttfn

hussein

-- 
=========================================================================
hussein suleman -- hussein@vt.edu -- vt cs --
http://purl.org/net/hussein
=========================================================================