- From: Charles McCathieNevile <chaals@opera.com>
- Date: Sat, 07 May 2011 13:18:17 +0200
- To: "Gregory J. Rosmaita" <oedipus@hicom.net>, "Benjamin Hawkes-Lewis" <bhawkeslewis@googlemail.com>
- Cc: "Laura Carlson" <laura.lee.carlson@gmail.com>, "HTML Accessibility Task Force" <public-html-a11y@w3.org>
On Fri, 06 May 2011 22:30:23 +0200, Benjamin Hawkes-Lewis <bhawkeslewis@googlemail.com> wrote: > On Fri, May 6, 2011 at 4:37 PM, Gregory J. Rosmaita <oedipus@hicom.net> > wrote: >> no, i am NOT ok with dropping the Hidden Metadata Fallacy -- this is >> a fundamental philosophical/logical principle: that discoverable >> metadata is available for those who NEED it > > Nobody disputes that users needs text alternatives, and HTML5 must > provide mechanisms for authors to provide them. > > The hidden metadata objection simply says that encouraging authors to > hide such text alternatives makes text alternatives more likely to be > poor quality. I accept that assertion as true. I also accept, as the chairs found, that it isn't very important in determining whether it should be possible to use hidden metadata... > Here's Tantek's objection from the poll: > > "[@longdesc] is one of the worst forms of invisible metadata or "dark > data" which are known to rot and become inaccurate over time (see: meta > keywords, RDF in comments, sidefiles, etc.)." Indeed. Because it is an edge case - while the web is filled with icons, pictures of text and the like, the marginal value of longdesc for most of the web's images does not justify the cost of providing a substantial description (as opposed to functional equivalent like alt, or textual role information like title). However, for those images where the cost is justified, and where longdesc is used to provide dark metadata (it can equally be used to link to content present in the page, if the author chooses) the cost of "dark metadata" is often justified - and it appears that many of the examples are in fact relatively high-value material with a relatively large investment in maintaining the data systems accuractely. No mechanism should force two-bit part-time amateurs to build expensive and complex high-value systems to publish a photo of their cat. But the web is used a lot by organisations and individuals who do put a high value on their content, manage it with a significant investment, and for whom the "dark metadata" argument is about as relevant as saving bottle-tops to recycle the metal as a source of income is to people like Tantek and Lachlan. > Here's Lachlan's objection from poll: > > "There are arguments based on the idea that hidden metadata is useful > for some unspecified use case, Actually, the use cases are specified. > with the supposed problem that any additional information that may be > referenced by longdesc, must be hidden from the vast majority of > users, and not otherwise affect the design of the page. This is not a supposed problem. It is an observed fact that some relevant proportion of thorough image descriptions will not be included in the content of a page. It is also a well-understood principle of usability and accessibility that adding more content to a page can easily *decrease* its accessibility for a large number of users. > "Such arguments are misguided, because > it is far better for accessibility for such issues to be considered a > prominent part of the page's design and/or user experience from the > outset, That it is better for accessibility to be considered from the start is not, I believe, in dispute. However, ignoring the above-mentioned principle about not overloading the user with content means this argument is fundamentally flawed - it would claim to justify large amunts of content on pages which are readily shown to have reduced accessibility because of that content. > and for the accessible content to be treated as a first class citizen > of the site. This argument assumes that first-class citizens are only those who are visible. It assumes that there is no management of content, rather than that sometimes the management is so minimal that anything not visible is forgotten. The reality of the Web is that many resources are managed far more carefully (although many are not - there's a lot of rubbish published too). In particular longdesc (which is relatively expensive) is used more by those who apply more management. > Hiding it behind the longdesc attribute, or any other similar method, > effectively treats it as a second or even third class citizen This is an unsubstantiated allegation about valuation of resources, used as a rhetorical device. It has no real bearing on the argument. > which has been clearly demonstrated to result in suboptimal alternative > content I accept this statement, although the question of degree is highly relevant in judging the importance of the argument. It is known that accessibility of the web is very "sub-optimal" (woeful, on average, is probably a reasonable description - it's certainly what Hixie suggested as his experience of using JAWS) and yet it is very clearly appreciated by and valuable to many people who rely on such accessibility as there is. > that never actually helps those it's intended to." This is again a sweeping and unsubstantiated generalisation. It relies on the assumption that anything less than perfect is terrible, which is logical fallacy and utter nonsense. > http://www.w3.org/2002/09/wbs/40318/issue-30-objection-poll/results#xfigure > > In the Chairs' decision, they accepted that the tendency of invisible > metadata to rot could help "explain the bad data that has been seen", > but treated this as a weak objection because only anecdotal evidence > was provided that rotten data would cause implementors or users to give > up on @longdesc and no evidence was provided that user agents could not > exclude a lot of bad data. > > As such, I would expect a "hidden metadata fallacy" section to put > forward arguments in favour of any of the following: > > * Errors in visible data and hidden metadata are equally likely to be > corrected. (This seems obviously false.) While "equal probability" is false, it is important to recognise that where information is managed (good libraries, well-run corporations, effective governments, organised and thoughtful individuals, community-supervised resources, etc etc) this argument becomes extremely weak, and its relevance rests on the fallacy Lachy assumed above that anything less than perfect is more harmful than helpful. > * In the special case of long text alternatives, authoring visible > data is likely to compromise quality more than hiding it because of the > risk that authors write captions that assume you can see and make sense > of the image. I would express the argument in terms of the difficulty of clearly describing an image in text that fits the flow of a page in which it appears. I would further note the point that overloading a page with information is shown not to increase, but to decrease its accessibility for many people. > * More authors will provide text alternatives if they can hide them, > and this is worth the accuracy cost of helping them to do so. Indeed. > * User agents could excude bad data. User agents can (trivially easily - it takes much less time than has been spent on this current thread) take the most common form of bad data - a description instead of a URL in the attribute value, and make it available to the user. It is also trivially easy to find this fault in validation tools, authoring environments, content management systems, etc. > * Users will not give up on @longdesc if they repeatedly encounter bad > @longdesc values. > > * User agents will not stop implementing @longdesc if users repeatedly > encounter bad @longdesc values. These arguments rest on either common sense (which I don't trust), or significant amounts of study with large numbers of users and clear discussions with user agent developers. My sense is that the first will prove true in line with my experience, and the second will depend in large part on the outcome of this issue in the HTML WG. > * Better implementations will make it easier to discover @longdesc > visually, reducing the error rate due to it being hidden from the > normal rendering of the page itself. This follows as a necessary consequence of the hidden metadata argument itself. > But instead, the "Hidden Metadata Fallacy" section lays seige to a > nonsenical strawman argument asserting that hidden metadata is not > discoverable at all. > >> i, for one, think there is a crucial distinction between "hidden >> metadata" and "discoverable metadata" > > What's the distinction? Actually, hidden is not a binary state, it depends on usage. This subtlety seems to have been lost to the "anti-longdescers", and is the principle which Gregory is trying to elucidate. I will try to propose a new text for the section tomorrow which clarifies these arguments. >> -- one could call "hidden" any metadata attached to an image (such as a >> JPEG file) which contains info about the camera, the resolution, and >> other technical metadata because when the image is rendered, such >> metadata is not immediately available to the user unless that metadata >> is extracted or exposed using a specialized tool -- this is the entire >> point of such initiatives as the RDF in Photo work within the W3C and >> some of the other initiatives that laura has documented... > > In the context of a webpage, those are all hidden metadata. No, some of them are presented (e.g. Flickr extracts them sometimes). But I don't think the argument is very strong (although it is valuable enough to explain), because those metadata are automatically generatable, and while they are indeed subject to quality degradation (e.g. a photo manipulation tool forgets to update them as approriate) they are relatively easily maintained. Longdesc, by virtue of being difficult or impossible to generate by machine in most instances, is vulnerable to quality degradation whenever it is not getting attention from actual humans. There is lots of work on ways to support quality maintainance automatically, and it isn't all complicated. But it is a qualitatively different job. Cheers Chaals -- Charles McCathieNevile Opera Software, Standards Group je parle français -- hablo español -- jeg lærer norsk http://my.opera.com/chaals Try Opera: http://www.opera.com
Received on Saturday, 7 May 2011 11:19:03 UTC