W3C home > Mailing lists > Public > public-html@w3.org > April 2008

Re: alt and authoring practices

From: Karl Dubost <karl@w3.org>
Date: Thu, 17 Apr 2008 10:34:21 +0900
Cc: "public-html@w3.org" <public-html@w3.org>
Message-Id: <55ECAC35-4D9F-4C0E-AB5F-BD2AAE4F1D55@w3.org>
To: Ian Hickson <ian@hixie.ch>


Le 17 avr. 2008 à 07:47, Ian Hickson a écrit :
>> * proportion of alt="[text]", alt="", alt=" ", no alt at all.
>
> What would that tell us?

1. alt="text", and alt="" is likely to be something done by authors.  
They took the time to do it. But gives only a partial information,  
it's why I mentionned 3. below.




>> * In the set of pages which contains img elements, distributions of
>> number of img elements (how many pages with 1 img, with 2 img, etc.)
>
> What would that tell us?

2. That would gives us the proportion of pages which are using images  
and how much of an issue it is for the Web. Maybe a better  
distribution would be

There are 10% of pages with 3 alt="", 1 alt="text", etc.
There are 20% of pages with 2 alt="", 0 alt="text", etc.

but it is a bit more complex, but that would give us a first rough  
estimate of proportions of possible decorative elements.



>> More challenging, distributions of "text", collect all the text
>> contained in alts, sort them out, and then sees what are the text  
>> which
>> are happening very often (I think about things like "logo"  
>> emerging, but
>> there might be surprises).
>
> I'm not sure what you mean by "sort them out". Could you elaborate?


cat lines.txt | sort | uniq -c | sort -nr


-- 
Karl Dubost - W3C
http://www.w3.org/QA/
Be Strict To Be Cool
Received on Thursday, 17 April 2008 01:34:56 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 9 May 2012 00:16:14 GMT