ISSUE-66 ("image analysis heuristics")

http://esw.w3.org/topic/HTML/ChangeProposals/ImageHeuristics

I plan on submitting this Change Proposal within 24 hours due to the 16 Jan deadline. Any comments or edits you can provide between now and then would be greatly appreciated.

Copied here for your convenience:

Summary

4.8.2 The img element includes:

"User agents may also apply image analysis heuristics to help the user make sense of the image when the user is unable to make direct use of the image, e.g. due to a visual disability or because they are using a text terminal with no graphics capabilities."

This text should be stricken, because it gives the message that this is viable now, and a reasonable substitute for @alt. Neither is true.


Rationale

Despite all evidence to the contrary, the current draft suggests that image heuristics algorithms could be used to recover from images with missing @alt. The technology not only does not exist: it cannot exist.

It's possible that Google Goggles was the proposed implementation of image analysis heuristics here, but while it may perform well with well-defined objects with large databases of established imagery, it is not a cure-all for the problem of missing @alt -- and if it were, in fact, that good, it would be better positioned as a tool for the authoring process than in the browser.

The reason for this is that images themselves are only place markers for what the author intends to express. It is the author, then, and not the image, that is most responsible for determining what fits best as alternate content. And for this reason, no automated tool can possibly claim to sufficiently repair missing @alt content. This sentence only serves to make that less clear.

User agents may use any technology they choose to improve the user experience for users with disabilities. Such an implementation may in fact have positive effects, in certain cases. However, it is not necessary to specify this, particularly if by doing so the necessity of human-created @alt is made less than perfectly clear.

A second problem is that image analysis heuristics such as Google Goggles require enormous (as in upper terabytes to lower petabytes) databases of content in order to function at anything better than parlor-trick levels of efficacy. At the moment, the only available provider for such a service is Google. This is, in effect, a binding in the specification to a closed-source, single-provider service, which sets a dangerous precedent.


Details

Remove the paragragh:

"User agents may also apply image analysis heuristics to help the user make sense of the image when the user is unable to make direct use of the image, e.g. due to a visual disability or because they are using a text terminal with no graphics capabilities."


Impact

	• Removing the text will not prevent UAs from implementing such a system, but will protect the importance of @alt from being undermined.

Positive Effects

	• Removing the text will remove the misconception that image analysis can currently be used to recover from images with missing text alternatives.
	• Removing the text will remove the potential creation of an HTML language feature that is only offered by one provider, and is only available in the cloud.

Conformance Classes Changes

	• None

Risks

	• None

Received on Thursday, 14 January 2010 23:09:50 UTC