- From: Ian Hickson <ian@hixie.ch>
- Date: Fri, 22 Aug 2008 22:45:24 +0000 (UTC)
- To: Steven Faulkner <faulkner.steve@gmail.com>
- Cc: public-html@w3.org, W3C WAI-XTECH <wai-xtech@w3.org>
On Thu, 21 Aug 2008, Steven Faulkner wrote: > > > >There are two problems with this. One is that the descriptive > >identification of the non-text content would almost certainly be > >provided anyway, e.g. as the image caption, for all users, and thus > >including it in the alt="" attribute would be redundant, leading to > >"stuttering" (that is, content repetition). > > Can you provide the the research to back up the claim that descriptive > identification would almost certainly be provided elsewhere? Sure. Using Google Sets to come up with some photo upload sites: http://labs.google.com/sets?hl=en&q1=flickr&q2=picasa&q3=photobucket&q4=imageshack&q5=&btn=Large+Set ...I visited a bunch of photo pages, by picking a cute image from the front of each site's front page: Flickr http://www.flickr.com/photos/moonflower5/2788138892/ Smugmug http://cmac.smugmug.com/gallery/5363890_2awbk/#334290495_zqEkV Photobucket http://media.photobucket.com/image/motivational%20posters%20or%20motivation/RukiTemaFan21/Motivational%20Posters/chibi.jpg?o=98 Picasa Web Albums http://picasaweb.google.com/lescagoules/IMELOOS05072006?feat=featured#5228949483608819442 Zooomr http://www.zooomr.com/photos/reveiled/2931890/ Fotki http://public.fotki.com/dwbrant/pets/7-21-2005/img_2890.html webshots http://outdoors.webshots.com/photo/2849914900101838694PfFJEB Share on Ovi http://share.ovi.com/media/PangeaDay.film/CHAPLIN.10150 Fotolog http://www.fotolog.com/emilylovesponcho glowfoto http://www.glowfoto.com/random.php zoto http://www.zoto.com/site/#USR.vapileh::PAG.detail::c14394d0e212c26fcbd2bba81f1f31d1 In all but one case, the photo had descriptive information elsewhere on the page. The exception was glowfoto, where it appears images don't have any descriptive information at all other than the user's name (which is on the page) and the category or album name (which is not on the page in the case of the random.php script, but which appears when you look at photos on a per-photo basis instead of randomly). > The caption for such an image does not necessarily suffice as the text > alternative Indeed, it almost never does. > much more descriptive information can be provided for disabled users who > cannot view the image or have a vision impairment that renders image > blurred. Indeed. That's required by HTML5 wherever possible. > Who are you to decide that this information is not worthy of inclusion > by mandating that the alt must be of the form {}. The information is worthy of inclusion. It _must_ be included if at all possible. The {...} form (or the omission of alt="" altogether, if we go back to that) is only allowed in the case where the tool generating the HTML simply doesn't have alternative text available (as, for instance, in all the cases listed above). > In the example below the alt provides descriptive information that may > be useful to visually impaired users. It provides information that would > be obvious to a sighted user but could not be ascertained by a vison > impaired user unless they asked someone to describe it to them. That is > the point of a textual alternative, to provide information to a disabled > user that cannot access the information themselves. > > <figure> > <img src="/commons/a/a7/Rorschach1.jpg" alt="a black vertically > symmetrical shape, which contains 4 small white areas and 9 smaller > black blotches seperated from the main body of the shape."> > <legend>A black outline of the first of the ten cards > in the Rorschach inkblot test.</legend> > </figure> Is that really what you see when you look at Rorschach1? Personally I see a butterfly with moth-eaten wings, and two little claw hands around the head. Of course, the whole point of a Rorschach test is that describing it supposedly shows insight into your personality, so the very act of writing alternative text for a Rorschach inkblot is a bit like collapsing a wavefunction. > > Could you give an example of such a page where one would have an image > > that cannot be described when the page is written but where the image > > nonetheless has enough associated data that the user would get > > confused? > > examples: > http://www.flickr.com/photos/ > http://www.flickr.com/photos/tags/australia/ > http://www.flickr.com/explore/interesting/7days/ I couldn't find any images on those pages that, per the current HTML5 text, are allowed to have the alt={...} text. (They are all inside images and therefore all require some useable alternative text.) In fact in all those cases, one wouldn't _need_ to come up with descriptive alternative text. On the first for example, the image caption or author, or some of the image's tags, works fine as alternative text, since the image is merely a thumbnail that leads to the image itself. > What josh provided was one set of videos, this in way way can be > considered a basis for any decisions, the study is in no way > 'scientific' or objective It's a whole heck of a lot more useful than nothing, which is what we have backing up the "always require alt even if it means the site has to chose between compliance and going out of business" arguments that have been put forward here. > > That is why it is more important to base our decisions on actual > > objective research. > > All the decisions appear to be made by you, so the use of 'our' seems > incorrect here. where is this 'objective research'?? Well, the tentative decision to use {...} was based on this research: * it's objectively verified that there are pages that have images that have no alternative text available where the generators of the HTML are not able to obtain that data. See, for example, all the pages listed at the top of this e-mail. * it's objectively verified that requiring alt="" attributes does not lead to image sharing sites requesting alternative text from their users. Evidence: HTML4 requires alt="" attributes, Flickr doesn't require users to enter alternative text. * it's objectively verified that such sites are major sites and are an important part of the Web ecosystem. For example, photobucket is #26 on the Alexa top 500, Flickr is 39: http://www.alexa.com/site/ds/top_sites?ts_mode=global&lang=none (Other ranking sites lead to similar conclusions.) These three points lead to this decision: => We should allow sites to be written that include images from users even if the sites do not have suitable replacement text. Now, to design the mechanism to allow this, several ideas were considered: * Omitting alt="" altogether for these cases. * Having a special syntax inside alt="" for these cases. * Having a new attribute for these cases. To decide between these, Philip and I both did studies examining alt="" values. Some of the research is described here (sorry for not making it available in more convenient forms): * http://philip.html5.org/data/alt-in-braces.txt * http://damowmow.com/temp/alt-in-braces.txt * http://krijnhoetmer.nl/irc-logs/whatwg/20080603#l-65 * http://krijnhoetmer.nl/irc-logs/whatwg/20080802#l-142 * http://krijnhoetmer.nl/irc-logs/whatwg/20080803#l-3 It indicates the alt={...} is pretty rare today. It also indicates that alt="" being omitted is pretty common today too (despite HTML4 requiring it, though that's another story). The following arguments impacted the decision, though they have nothing beyond logic, anecdotal evidence, or conclusions derived from unpublished studies to back them up: * Introducing attributes for features that are supposed to be an indicator of a problem (lack of alt text, in this case) isn't good language design, as it brings it too much into prominence. * An attribute would almost certainly be copied around unintentionally by authors leading to it being at least as unreliable as the special syntax if not more. * An attribute introduces a whole class of extra conformance errors and complications, such as what to do when it is used with or without the alt="" attribute. All the above led to what the spec says today, which is the alt={...} idea. Since the above, however, it has been pointed out that one problem with the alt="{...}" idea exists that wasn't previously considered, namely that it makes it harder for systems that _are_ trying to automate alternative text creation to do so, since they now have to avoid accidentally using the {...} syntax. This dramatically lowers the attractiveness of the {...} idea, leaving pretty much only omitting alt="" for these cases as a least-worse option. -- Ian Hickson U+1047E )\._.,--....,'``. fL http://ln.hixie.ch/ U+263A /, _.. \ _\ ;`._ ,. Things that are impossible just take longer. `._.-(,_..'--(,_..'`-.;.'
Received on Friday, 22 August 2008 22:45:49 UTC