- From: Karl Dubost <karl@w3.org>
- Date: Thu, 17 Apr 2008 10:34:21 +0900
- To: Ian Hickson <ian@hixie.ch>
- Cc: "public-html@w3.org" <public-html@w3.org>
Le 17 avr. 2008 à 07:47, Ian Hickson a écrit : >> * proportion of alt="[text]", alt="", alt=" ", no alt at all. > > What would that tell us? 1. alt="text", and alt="" is likely to be something done by authors. They took the time to do it. But gives only a partial information, it's why I mentionned 3. below. >> * In the set of pages which contains img elements, distributions of >> number of img elements (how many pages with 1 img, with 2 img, etc.) > > What would that tell us? 2. That would gives us the proportion of pages which are using images and how much of an issue it is for the Web. Maybe a better distribution would be There are 10% of pages with 3 alt="", 1 alt="text", etc. There are 20% of pages with 2 alt="", 0 alt="text", etc. but it is a bit more complex, but that would give us a first rough estimate of proportions of possible decorative elements. >> More challenging, distributions of "text", collect all the text >> contained in alts, sort them out, and then sees what are the text >> which >> are happening very often (I think about things like "logo" >> emerging, but >> there might be surprises). > > I'm not sure what you mean by "sort them out". Could you elaborate? cat lines.txt | sort | uniq -c | sort -nr -- Karl Dubost - W3C http://www.w3.org/QA/ Be Strict To Be Cool
Received on Thursday, 17 April 2008 01:34:56 UTC