Copyright © 2003 W3C ® ( MIT , ERCIM , Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply.
NOTICE:
This draft includes a number or editorial changes, recent proposals and an additional level of conformance.
W3C published the Web Content Accessibility Guidelines 1.0 (WCAG 1.0) as a Recommendation in May 1999. This Working Draft for version 2.0 builds on WCAG 1.0. It has the same aim: to explain how to make Web content accessible to people with disabilities and to define target levels of accessibility. Incorporating feedback on WCAG 1.0, this Working Draft of version 2.0 focuses on checkpoints. It attempts to apply checkpoints to a wider range of technologies and to use wording that may be understood by a more varied audience.
This document is an editors' copy that has no official standing.
This document is prepared by the Web Content Accessibility Guidelines Working Group (WCAG WG) to show how more generalized (less HTML-specific) WCAG checkpoints might read. This draft is not yet based on consensus of the WCAG Working Group nor has it gone through W3C process. This Working Draft in no way supersedes WCAG 1.0.
Please refer to "Issue Tracking for WCAG 2.0 Working Draft" for a list of open issues related to this Working Draft. The "History of Changes to WCAG 2.0 Working Drafts" is also available.
This is a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress". A list of current W3C Recommendations and other technical documents is available.
The Working Group welcomes comments on this document at public-comments-wcag20@w3.org. The archives for this list are publicly available. Archives of the WCAG WG mailing list discussions are also publicly available.
Patent disclosures relevant to this specification may be found on the WCAG Working Group's patent disclosure page in conformance with W3C policy.
This document has been produced as part of the W3C Web Accessibility Initiative (WAI). The goals of the WCAG WG are discussed in the Working Group charter. The WCAG WG is part of the WAI Technical Activity.
A text equivalent
serves the same function as the non-text content was intended to serve.
communicates the same information as the non-text content was intended to convey.
may contain structured content or metadata.
Note:
Text-equivalents should be easily convertible to braille or speech, displayed in a larger font or different colors, fed to language translators or abstracting software, etc.
non-text content includes but is not limited to images, text in raster images, image map regions, animations (e.g., animated GIFs), ASCII art, images used as list bullets, spacers, graphical buttons, sounds (played with or without user interaction), stand-alone audio files, audio tracks of video, and video.
Note:
Scripts, applets, and programmatic objects are not covered in this definition and are addressed in checkpoint 4.3.[I#333]
content that can be expressed accurately and unambiguously in a reasonable number of words (for example, diagrams, charts, illustrations, etc.) Content such as a musical performance or visual artwork is considered "content that can not be expressed in words," since this type of content relies heavily on the visual (or auditory) experience.[I#320]
Individuals who are blind, have low vision, have cognitive disabilities or have trouble reading text for any reason can have the text read aloud to them.
Individuals who are deaf, are hard of hearing or who are having trouble understanding the audio information for any reason can read the text presentation or have it translated and presented as sign language by their assistive technology.
Individuals who are blind or deaf-blind can have the information presented in braille.
Example 1: an image used as a button. (short description of function)
A right arrow icon is used to link to the next slide in a slide show. The text equivalent is "Next Slide," so that what is read by a screen reader would be "link: Next Slide."
Example 2: a data chart. (short label + longer description)
A bar chart compares how many widgets were sold in June, July, and August. The short label says, "Figure one - Sales in June, July and August." The longer description identifies the type of chart or graph, provides a high-level summary of the data comparable to that available from the chart or graph, and lists the data themselves.
Example 3: an animation. (short label + longer description)
An animation shows how to tie a knot. The short label says, "An animation showing how to tie a square knot." The longer explanation describes the hand movements needed to tie the knot.
Example 4: an audio file of a speech. (short label + transcript)
An audio file is embedded in a Web page. The short label says, "Chairman's speech to the assembly." A link to a text transcript is provided immediately after the clip.
Example 5: an audio file of a symphony. (short label)
An audio file is embedded in a Web page. The short label says, "Beethoven's 5th Symphony performed by the Vienna Philharmonic Orchestra."
Editorial Note (06/10/03): There is discussion about moving some of the current success criteria from Required to Best Practice or to an Extended checkpoint. The issue stems from trying to apply the success criteria to every Web cam, newscast, and home broadcast. Another approach is to allow a conformance claim to state, for example, "All pages and applications on this site meet the Core checkpoints of WCAG 2.0 except the Web cam at http://example.org/webcam/."
a text document that merges all audio descriptions and captions into a collated script (that provides dialog, important sounds and important visual information in a single text document) is provided.
captions and audio descriptions are provided for all live broadcasts which provide the same information. [I#335]
the presentation does not require the user to read captions and the visual presentation simultaneously in order to understand the content.
A time-dependent presentation is a presentation that
is composed of synchronized audio and visual tracks (e.g., a movie), OR
requires the user to respond interactively at specific times in the presentation.
Media equivalents present essential audio information visually (captions) and essential video information auditorily (audio descriptions).
[Definition: captions are text equivalents of auditory information from speech, sound effects, and ambient sounds that are synchronized with the multimedia presentation.]
[Definition: audio descriptions are equivalents of visual information from actions, body language, graphics, and scene changes that are voiced (either by a human or a speech synthesizer) and synchronized with the multimedia presentation.]
People who are deaf or have a hearing loss can access the auditory information through the captions.
People who are blind or have low vision as well as those with cognitive disabilities who have difficulty interpreting visually what is happening benefit from the audio descriptions of the visual information.
People without disabilities also benefit from the media equivalents.
People in noisy environments or with muted sound often use captions.
Captions are used by many to develop language and reading skills.
Audio descriptions also provide visual information for people who are temporarily looking away from the video presentation such as when following an instructional video and looking at their hands.
Captions and text descriptions can also be used to index and search media files.
Note:
Time-dependent presentations requiring people to use a single sense to follow two or more things at the same time may present significant barriers to some users. Depending on the nature of the presentation, it may be possible to avoid scenarios where, for example, a deaf user would be required to watch an action on the screen and read the captions at the same time. However, this may not be available for live broadcasts (e.g. a football game). Where possible (especially for education and training materials), provide content so that it does not require tracking multiple simultaneous events with the same sense, or, give the user the ability to freeze the video so that captions can be read without missing the video.
Example 1: a movie clip with audio description and captions.
A clip from a movie is published on a Web site. In the clip, a child is trying to lure a puppy to the child's bedroom by laying a trail of crumbs. The child mumbles inaudibly to himself as he lays the trail. When not watching the video, it is not obvious that he is laying a trail of crumbs since all you hear is the mumbling. The audio description that is interspersed with the child's mumbling says "Charlie lays a crumb on each stair leading to his room." The caption that appears as he mumbles is, "[inaudible mumbling]."
Example 2: a video clip of a news story.
A video clip accompanies a news story about the recent flooding in a major city. The reporter describes what is seen, for everyone. No audio description is necessary. The captions display what the reporter is saying.
Example 3: a silent animation.
An animation shows a pantomime climbing a ladder. There is no audio track for this animation. No captions or audio description are required. Instead, a text equivalent is provided as described in checkpoint 1.1.
Content is the information or meaning and function.[I#339]
Presentation is the rendering of the content and structure in a form that can be sensed by the user.
Functionality is the function or purpose of the content. The functionality or purpose of content can include presentation of information, as well as any other functionality such as data collection, securing a response from the user, providing user experience, testing, confirmation, purchasing, etc.[I#338]
Structure includes both hierarchical structure of the content and non-hierarchical relationships such as cross-references, or the correspondence between header and data cells in a table.
Separating content and structure from presentation allows Web pages to be presented differently to meet the needs and constraints of different users without losing any of the information or structure. For example, information can be presented via speech or braille (text) that was originally intended to be presented visually.
It can also facilitate automatic emphasis of structure or more efficient navigation.[I#340]
All of these can benefit people with cognitive, physical, hearing, and visual disabilities.[I#340]
Example 1: a multi-column document.
A document is marked up with headings, paragraphs and other structural features. It is presented visually in three columns. The markup that creates the columns is separate from the markup that specifies the logical structure of the document.
Example 2: a scrolling list of stock prices.
Current stock quotes are scrolled horizontally across the screen. The data are separate from the methods used to scroll the text across the page.
Example 3: a 3-dimensional site map.
A custom user interface renders 3D visualizations of the pages on a site and how they relate to one another from a data source. Any hierarchical relationships, groupings, cross-references, etc. would originate in the data source so that alternate interfaces could be rendered (from the same source) that expose the structure of the site in an accessible form. (See also checkpoint 4.3)
Example 4: a list that allows users to sort information on a page according to preference.
A script allows a user to rearrange a categorical listing of music files by date, artist, genre, or file size. The script updates both the structure and the presentation accordingly when generating alternate views.
Facilitating unambiguous decoding of characters and words in content is also helpful for individuals who are learning to read or learning a second language.
Example 2: an acronym in a page title.
In the following heading, "People of the W3C." the acronym "W3C" is marked as an acronym. Because it has been marked appropriately, the user agent would be able to speak the letters of the acronym one at a time rather than attempting to pronounce it as though it were a word.
alternate presentation formats are provided for displaying pages [I#343]
for visual presentations, use font variations, styles, size and white space to emphasize structure.
use color and graphics to emphasize structure.
for auditory presentations, use different voice characteristics and/sounds for major headings, sections and other structural elements.
if content is targeted for a specific user group and the presentation of the structured content is not salient enough to meet the needs of your audience, use additional graphics, colors, sounds, and other aspects of presentation to emphasize the structure.
provide a table of contents or navigation map of the document.[I#342]
Presentation that emphasizes structure:
enables users with cognitive and visual disabilities to orient themselves within the content,
enables all users to move quickly through the content and notice major content divisions
enables all users, but particularly users with visual or cognitive disabilities to focus on important content,
enables all users, but particularly users with visual or cognitive disabilities to distinguish the different types of content.
Example 1: documentation for a product.
Identifying chapters in the structure of a book is appropriate and accepted use of labeling the structure. Within the chapters, headings identify (label) changes in context and highlight ideas contained in the following text. Subtle differences between the appearance of the chapter title and the section headings helps the user understand the hierarchy and relationship between the title and headings. The only difference might be font size and margin indentation when presented visually, and spoken in a difference voice or preceded by a sound when presented auditorily.
Example 2: a data table.
Groups of rows or columns are labeled with headers.
Example 3: an audio presentation.
An audio rendering of a document, generated according to a style sheet, uses a different, more formal voice to read titles and headers so the listener can easily identify the words as a title and not part of the running text.
Individuals with low vision can easily make out characters in the content even if they don't have the wide field of view used by fully sighted persons to separate text from background images.
Individuals with hearing impairments that limit their ability to hear all of the frequencies of speech can make out the words from the sounds they can hear because they are not mixed with residual sounds from the music.
Example 1: a background image on a page.
A background image and text are arranged so that there is no image behind the text or the image is so faint that the difference between the darkest part of the image and the text (which is dark) meets the standard foreground/background contrast requirements. The image behind the text also does not contain lines that are about the same width as the characters so they do not interfere with character recognition.
Example 2: speech over background sounds.
Because speech is often naturally mixed with background sounds (movies, live news etc) and cannot be easily removed or separated, captions are provided (under checkpoint 1.2) to make dialog understandable. However not all people can see or read the captions. Where speech is mixed or recorded so that it is at least 20 db above any background sounds people do not need to rely on captions to understand the dialog.
The term operable includes the concept of efficiency. That is, it implies that the device can be operated from the keyboard in a reasonably efficient fashion. Using MouseKeys or having to tab dozens of times to move through a small section of a document or page, or other unreasonably inefficient keyboard access would not qualify. If a document has a very large number of links, some mechanism other than tabbing through them one at a time needs to be provided. This might include provision of headers (for header navigation), the use of skip navigation, links, etc.[I#346]
A keyboard interface is the point where the application accepts any input that would come from the keyboard (or optional keyboard).
Individuals who are blind (and cannot use pointing devices) can have access to the functionality of the Web content or site.
Individuals with severe physical disabilities can use speech input (which simulates keystrokes) to both enter data and operate the interface elements on the page.
Example 1: operation with multiple input devices.
The content relies only on focus-in, focus-out, and activation events; these are defined in the API of the environment for which the content is written, and are intended to be operable by a variety of input devices, including pointing devices, keyboards and speech input systems.
Example 2: examples of Web content that would and would not be operable from a keyboard or keyboard interface
If it's written to be operable from a computer keyboard, it conforms. (because it is operable from the keyboard.)
If it's written to be used on a device that doesn't usually have a keyboard such as a cell phone and but it can be controlled by an optional keyboard for that device, it conforms. (A person who needs a keyboard - or alternate keyboard - can use it to control the application.)
If it's written to be used with a device that doesn't have a keyboard, but it could also be used by similar devices that do and it would work with their keyboard, it conforms. (A person who needs a keyboard would not buy the device without the keyboard. That device may itself not be considered accessible. But the content can be controlled from a device with a keyboard and therefore conforms to this checkpoint.)
If it's written to work with devices that do not have keyboards and it can not be used by any other devices that do have keyboards, then it does not conform. (It cannot be accessed via keyboard.)
Real-time events are those that are based on the occurrence of events in real-time where the events are not under the control of the author.
A competitive activity is an activity where timing is an essential part of the design of the activity. Removal of the time element would change the performance of the participants. Versions of the activity (e.g. test) that have no time basis or time limits might be preferred and may be required for some venues but this would require a complete redesign of the activity (e.g. test) and may change the character and validation methodology and would therefore not fall under these guidelines.
People with reading disabilities, cognitive disabilities, and learning disabilities often need more time than most people to read and comprehend written text. People with physical disabilities might not be able to move quickly or accurately enough to interact with moving objects.
Content that is updated often might not be processed and read in time or in the proper order by an assistive technology or voice browser.
Examples of content that requires comprehension or a response within a timed interval[I#348] :
automatic refresh
redirection
blinking or scrolling text
dialog that disappears after a short period
shutdown or deactivation of page if activity is not received in a set amount of time
Example 1: blinking text.
Client-side scripting is used to create blinking text. The user can deactivate the use of scripting in his or her browser or override the use of scripts with a user style sheet.
Example 2: a news site that is updated regularly.
A news site causes its front page to be updated every 1/2 hour. The front page contains minimal text and primarily consists of links to content. A user who does not wish the page to update selects a checkbox. The checkbox is in the "user preferences" portion of the site which is one of the first links on each page.
Editorial Note (06/10/03): This Checkpoint is currently included in the Core set of Checkpoints because the WCAG WG expects that it will be possible to test content for flicker and the result will be a flicker rate in Hz that can be stored in a machine-readable format. The WG also expects that a user could, in the future, be able to configure a client to only render content that flickers at "a rate greater than X" by which the client could compare the content metadata and the user's preference. If the assumption regarding a testing tool does not hold at time of final review of these guidelines, this checkpoint will be moved to the Extended set of Checkpoints."
Individuals with photosensitive epilepsy can have seizures triggered by flickering or flashing in the 3 to 49 flashes per second (Hertz) range with a peak sensitivity at 20 flashes per second.
Individuals who are easily distracted may not be able to focus on page content with flicker occurring in the same visual field.
where possible, logical tab order has been created.[I#319]
Editorial Note: One of the reasons for combining these two (old 3.1 and 3.2) is that they both get at the same basic issue. Also, on many sites, it is becoming increasingly difficult to tell when you are navigating within a site and when you are navigating within a document. This will only increase over time. Since the title of this thing is Web content, it is recommended that these two items be combined so that we are talking about Web content versus separating content from sites.
The structure of content represents changes in context. For example,
A book is divided into chapters, paragraphs, lists, etc. Chapter titles help the reader anticipate the meaning of the following paragraphs. Lists clearly indicate separate, yet related ideas. All of these divisions help the reader anticipate changes in context.
A bicycle is divided into wheels and a frame. Further, a wheel is divided into a tire and a rim. In an image of the bicycle, one group of circles and lines becomes "wheel" while another group becomes "frame."
A site navigation mechanism is a mechanism for easily orienting and moving about within the site. Site navigation mechanisms include but are not limited to:
A home page with hyperlinks on it and subsequent pages that link to the other pages at the site
site map(s)
search engine(s)
expanding outline(s)
dynamic fisheye views showing all linked pages or topics related to any page.
3-D virtual representations of site content
When the logical structure is provided in markup or a data model,
Users with physical disabilities can use structure to more easily jump between paragraphs, chapters, sections etc.
Users with cognitive disabilities can use structure (chapter titles, headers, etc.) to provide more context for the text that follows them. They also provide warning of a change in context and reorient the user to the new focus.
Users with blindness or low vision can jump from header to header to get an overview or to more quickly "skim" to the section they are interested in.
Readers with low vision can sometimes (depending on display technology) change how chapter titles and headers are displayed to make them more visible -and easier to use when skimming the document.
the content can be presented on a variety of devices because the device software can choose only those elements of the content that it is able to display and display them in the most effective way for that device.
Providing different navigation mechanisms can provide a better match between different people's skills, background knowledge, visual vs. text orientation, and the type of information they are seeking at the moment.
Individuals with cognitive disabilities may find it easier to ask for what they want than to deduce its location from categorical choices.
Individuals with low vision or blindness may find search techniques that fetch everything that relates to a topic of interest to be easier than techniques that require them to scan lists or pages for the items.
Example 1: a physics dissertation.
A dissertation contains well-defined sections such as "Abstract," "Table of Contents," "Chapter 1," etc. The pieces in each section (paragraphs, subheadings, quotes) are denoted with structural markup.
Example 2: a scalable image of a bicycle.
Lines and a circle (spokes and rim) are grouped into a "wheel." Lines in a triangle that attach to each wheel are grouped into a "frame."
Example 3: user interface.
User interface controls are divided into organized groups.
Natural languages are those used by humans to communicate, including spoken, written, and signed languages.
Phrases from various languages, acronyms and abbreviations are often interspersed in writing. When these phrases are identified, a speech synthesizer can voice text with the appropriate accent and pronunciation. When they are not identified, the speech synthesizer will use the default accent and pronunciation of the language on the rest of the page, which can make the phrase unintelligible. Identifying changes in language and marking abbreviations and acronyms as such will also allow a tool to ask for automatic translations of that content. When editing content, authoring tools can switch between appropriate spelling dictionaries.
Example 1: a French phrase in an English sentence.
In the following sentence, "And with a certain je ne sais quoi, she entered both the room, and his life, forever." the French phrase "je ne sais quoi" is marked as French. Depending on the markup language, English may either be marked as the language for the entire document except where specified, or marked at the paragraph level.
the content has been reviewed, taking into account the following strategies for determining the definition of abbreviations and acronyms, applying them as appropriate.
provide a definition or link (with the first occurrence) of phrases, words, acronyms, and abbreviations specific to a particular community.
provide a summary for relationships that may not be obvious from analyzing the structure of a table but that may be apparent in a visual rendering of the table.
if contracted forms of words are used such that they are ambiguous, provide semantic markup to make words unique and interpretable.
Content is considered complex if the relationships between pieces of information are not easy to figure out. If the presentation of the information is intended to highlight trends or relationships between concepts, these should be explicitly stated in the summary.
Examples of complex information:
data tables,
concepts that are esoteric or difficult to understand,
content that involves several layers.
Content might be unfamiliar if you are using terms specific to a particular community. For example, many of the terms used in this document are specific to the disability community.
Summarizing information that is difficult to understand helps people who do not read well.
Providing a summary of the visual cues that show relationships between complex information helps people who do not use visual cues or who have difficulty using visual cues. For example, people who are completely blind do not use any visual cues, while people with dyslexia or with low vision might have difficulty interpreting visual cues.
Defining key terms and specialized language will help people who are not familiar with the topic.
Providing the expansion of abbreviations and acronyms not only helps people who are not familiar with the abbreviation or acronym but can clarify which meaning of an abbreviation or acronym is appropriate to use. For example, the acronym "ADA" stands for both the American with Disabilities Act as well as the American Dental Association.
the content has been reviewed, taking into account the following strategies for evaluating the complexity of the content, applying them as appropriate.
use of sentence structures that increase understanding
such as active voice in languages where this form helps convey information
length of noun phrases
strings of no more than three or four nouns are easiest to understand
clarity of reference with pronouns and anaphoric expressions (these refer back to something already said in the text)
example of potential ambiguity: "Scientists study monkeys. They eat bananas."
correct use of conjunction forms and adverbs to make explicit the relationship between phrases or parts of the text
such as "and," "but," "furthermore," "not only"
complexity of verb tenses
do the tenses used in a document seem overly complicated?
intelligibility of verb phrases
familiarity of idioms or slang
logic in the order and flow of information
consequences of ambiguity or abstraction
improved readability of vertical lists might offer in place of long paragraphs of information
use of summaries to aid understanding
thoroughness in the explanation of instructions or required actions
consistency in the use of names and labels
clarity where the document:
addresses users
explains choices and options
labels options to get more information
instructs users how to modify selections in critical functions (such as how to delete an item from a shopping cart)
application of:
proper markup to highlight key information
goal-action structure for menu prompts
default settings (and the ease in re-establishing them)
two-step "select and confirm" processes to reduce accidental selections for critical functions
calculation assistance to reduce the need to calculate
testing with potential users for ease of accessibility
use of a controlled language
providing support for conversion into symbolic languages
adding non-text content to the site for key pages or sections specifically to make the site more understandable by users who cannot understand the text only version of the site.
Controlled languages use a restricted vocabulary taken from natural language. The purpose is to make texts easier to understand and translate. Standards generally limit words to a single meaning and prescribed part of speech. Complex syntax is avoided. Information about controlled language applications is available on the World Wide Web.
Non-text content includes images, text in raster images, image map regions, animations (e.g., animated GIFs), applets and programmatic objects, ASCII art, scripts, images used as list bullets, spacers, graphical buttons, sounds (played with or without user interaction), stand-alone audio files, audio tracks of video, and video.
Editorial Note: Is this definition adequate?
All users, especially those with cognitive, learning, and/or reading disabilities benefit from the use of clear and simple writing. This should not discourage you from expressing complex or technical ideas.
Using clear and simple language also benefits people whose first language differs from your own, including those people who communicate primarily in sign language.
Sounds, graphics, videos and animations can help make concepts presented in a Web site easier to understand, especially for people with cognitive, reading, or learning disabilities or those who are unfamiliar with the language of the text of the site.
Note:
Designers need to be cautious in deciding when to use illustrations. Reading a picture is probably a learned activity that is easier for some than others. Some users skip the pictures; others read only the pictures. Designers must also recognize that visual conventions are not universal and that individuals develop their own mental schema and expectations in interpreting visual information.
Example 1: a description of a process.
A page describes how to learn to play soccer. Each step in learning the fundamentals of the game is illustrated with a photograph of a player doing what is described in the text.
Example 2: a concrete concept.
The primary concept on a page is concrete. It is discussing Mt. Pinatubo. It includes both a description of the 1991 eruption as well as photos of the eruption and the aftermath. It links to another site that contains video and another site that contains a 3D simulation of what happened underneath the crust and within the volcano during the eruption.
Example 3: child's report of school trip.
A child went with her school on a trip to a bicycle manufacturing plant. She wrote a report for her family and friends to post to the Web. In the report, she includes the company logo as well as a picture of a bicycle on the assembly line. She links to the company Web site for more information. She includes photos she took of the plant.
Example 4: stock trading data.
A news site is comparing the performance of the economy from 3rd quarter of this year with 3rd quarter from the last 3 years. They compare prices of the most popular stocks. They present the data in a bar graph with a link to the raw data they used to create the bar graph.
Example 5: history of music.
A grandfather's hobby is listening to and playing music. He creates a Web site that includes examples of many different types of music and musical instruments. When describing specific types of music, he links to a short sound clip.
the content has been reviewed, taking into account the additional ideas listed below:
place navigation bars in a consistent location whenever possible
similar layout for user interface components should be used for sections or whole site
similar user interface components should be labeled with similar terminology
use headers consistently
use templates for consistent presentation of sections or whole site
pages with similar function should have similar appearance and layout
controls that look or sound the same should be designed to act the same
conventions likely to be familiar to the user should be followed
unusual user interface features or behaviors that are likely to confuse the first-time user should be described to the user before they are encountered
allow the user to select different page layout templates for presentation of pages. (e.g. 3 column, linear, adding extra orientation or navigation elements, etc.) [I#353]
Mechanisms that cause extreme changes in context include:
opening a new browser window unexpectedly and without any nonvisual cue (back button suddenly appears nonfunctional)
in an auditory presentation, the speaker changes with no visual cue and no notation in captions
captions that do not identify a change in speaker
Common user actions include:
mouse movements
key activation
link selection
use of browser navigation buttons (e.g. back and forward)
opening new browser windows
Common responses to user actions include:
loading a new page
exposing/concealing content based on mouse position or keyboard focus
displaying the contents of a menu (auditorily or visually)
displaying pop-up menus or windows
submitting a form
It is important that responses to user actions be predictable and sensible to the end user and that interactions are consistent, both throughout the site and with commonly used interaction metaphors used throughout the Web.
Individuals who are unable to detect extreme changes in context or may not realize that the context has changed are less likely to become disoriented while navigating a site. This applies to people in the following ways:
Individuals who are blind or have low vision may have difficulty knowing when a visual context change, such as a new window popping up, has occurred. In this case, warning users of context changes in advance minimizes confusion when the user discovers that the back button no longer behaves as expected.
Using captions to note changes in speaker is beneficial for individuals who are deaf or hard of hearing and who may be unable to discern changes in speaker for audio-only presentations.
Some individuals with low vision, with dyslexia and who have difficulty interpreting visual cues may benefit from additional cues in order to detect extreme changes in context.
Note:
Providing consistent and predictable responses to user actions is important feedback for the user. This lets them know that your site is working properly and encourages them to continue interacting with the content. When the user receives an unexpected response, they might conclude that something is wrong or broken. Some people might get so confused they will not be able to use your site.
Example 1: a form to deactivate pop-up windows.
A checkbox is provided on a page of links to let the user select whether they want the resultant pages to appear in new windows or not.
Example 2: a warning given before a pop-up window.
At the end of a news story, several links are provided for more information. At the beginning of each link is an icon of an arrow with the text equivalent, "Link will open in new window."
Example 3: frames that do not track history making the back button behave unexpectedly.
Example 4: forms.
Editorial Note: Some of these examples are very brief. Should they be expanded and clarified with further details?
A search engine is provided with a variety of search options for different skill levels and preferences. It includes a spell checker and offers "best guess" alternatives, query-by-example searches, and similarity searches.
Example 1: structural elements.
Throughout a Web site, structural elements are not used for purposes of presentation. Likewise, presentational elements are not used for purposes of structure.
Example 2: accessible API's.
A Java applet uses the accessibility API defined by the language. Refer to the IBM Guidelines for Writing Accessible Applications Using 100% Pure Java.
of at least two such implementations, it is true that the technologies and features on the required list have been supported by at least one prior version of the software.
Editorial Note: A definition of "widely available" should be added here to include something which is low cost and available in many?/most? countries/languages.
A technology is a
markup or programming language
application Programming Interface (API)
or communication protocol
A feature is a specific component of a technology, for example an element in a markup language or a function call in an Application Programming Interface. Typically, a given feature may only be available in specific versions of the technology, and thus may need to be noted explicitly in the required list.
Benefits of determining and documenting baseline user agent requirements:
Individuals can identify (either through site documentation or automatically through metadata) whether or not they are likely to be able to use a site. In conjunction with a search engine or a proxy server, this could be used to automatically filter out sites a user can not access or to automatically filter to the top sites that would be most usable.
Requiring sites to document their baseline will cause them to evaluate assumptions about user agents and will minimize the number of sites that are inadvertently inaccessible because they are unaware of backward compatibility issues.
Benefits of designing for backward compatibility:
Individuals who must use alternative browsing technologies and devices will be able to access the content.
Individuals who can not afford or otherwise do not have access to newer technologies also benefit from backward compatibility in that they will not need to purchase upgrades or equipment as often.
Example 1: an online store.
By documenting minimum user agent requirements, the store makes it possible for people using particular technologies to determine if they are going to have trouble using the store or its checkout mechanism without having to go through the entire process of shopping and checkout only to find out that they are unable to complete their transaction at the end. They can, therefore, shop at stores they can be successful at.
Example 2: an Intranet site.
A large company was concerned about the ability to address individuals at many diverse sites that have different technology bases. They have, therefore, created two versions of their content and documented the requirements for each, making it easy for individual sites to determine which version would work for their technologies.
the interface has been tested using a variety of assistive technologies and preferably real people with disabilities who use assistive technologies to determine that assistive technologies can access all information on the page or hidden within the page.
Editorial Note: It would be possible to comply with the checkpoint without carrying out tests (either with users or with assistive technologies). Conversely, it is possible to conduct tests, but still fail to meet the checkpoint (with respect to assistive technologies that were not tested, for example). Should this success criterion be deleted?
accessibility conventions of the markup or programming language (API's or specific markup) are used [I#331]
Authors who utilize technologies designed to support accessibility will:
encounter fewer challenges when implementing these guidelines
avoid the need to create custom solutions and workarounds to address accessibility concerns
avoid the need to provide accessible alternate versions for content rendered in a technology that does not fully address these guidelines
Individuals who rely on assistive technologies to access the Web will be able interact with the content.
Individuals who access the Web with older technologies or alternative browsing devices such as PDAs and cell phones also benefit from the inclusion of accessible alternatives to custom user interfaces.
Editorial Note: The WCAG WG has not tackled the definitions of the terms that we are using and acknowledges that we sometimes use terms inconsistently. We need to coordinate our terms and definitions with the WAI Glossary and are working on proposals for a variety of definitions. We have been looking at the UAAG 1.0 glossary and other glossaries within the W3C.
content that can be expressed accurately and unambiguously in a reasonable number of words (for example, diagrams, charts, illustrations, etc.) Content such as a musical performance or visual artwork is considered "content that can not be expressed in words," since this type of content relies heavily on the visual (or auditory) experience.[I#320]
A competitive activity is an activity where timing is an essential part of the design of the activity. Removal of the time element would change the performance of the participants. Versions of the activity (e.g. test) that have no time basis or time limits might be preferred and may be required for some venues but this would require a complete redesign of the activity (e.g. test) and may change the character and validation methodology and would therefore not fall under these guidelines.
Content is considered complex if the relationships between pieces of information are not easy to figure out. If the presentation of the information is intended to highlight trends or relationships between concepts, these should be explicitly stated in the summary.
Examples of complex information:
data tables,
concepts that are esoteric or difficult to understand,
content that involves several layers.
Content is the information or meaning and function.[I#339]
Controlled languages use a restricted vocabulary taken from natural language. The purpose is to make texts easier to understand and translate. Standards generally limit words to a single meaning and prescribed part of speech. Complex syntax is avoided. Information about controlled language applications is available on the World Wide Web.
A feature is a specific component of a technology, for example an element in a markup language or a function call in an Application Programming Interface. Typically, a given feature may only be available in specific versions of the technology, and thus may need to be noted explicitly in the required list.
Functionality is the function or purpose of the content. The functionality or purpose of content can include presentation of information, as well as any other functionality such as data collection, securing a response from the user, providing user experience, testing, confirmation, purchasing, etc.[I#338]
A keyboard interface is the point where the application accepts any input that would come from the keyboard (or optional keyboard).
Mechanisms that cause extreme changes in context include:
opening a new browser window unexpectedly and without any nonvisual cue (back button suddenly appears nonfunctional)
in an auditory presentation, the speaker changes with no visual cue and no notation in captions
captions that do not identify a change in speaker
Common user actions include:
mouse movements
key activation
link selection
use of browser navigation buttons (e.g. back and forward)
opening new browser windows
Common responses to user actions include:
loading a new page
exposing/concealing content based on mouse position or keyboard focus
displaying the contents of a menu (auditorily or visually)
displaying pop-up menus or windows
submitting a form
It is important that responses to user actions be predictable and sensible to the end user and that interactions are consistent, both throughout the site and with commonly used interaction metaphors used throughout the Web.
Media equivalents present essential audio information visually (captions) and essential video information auditorily (audio descriptions).
[Definition: captions are text equivalents of auditory information from speech, sound effects, and ambient sounds that are synchronized with the multimedia presentation.]
[Definition: audio descriptions are equivalents of visual information from actions, body language, graphics, and scene changes that are voiced (either by a human or a speech synthesizer) and synchronized with the multimedia presentation.]
Natural languages are those used by humans to communicate, including spoken, written, and signed languages.
non-text content includes but is not limited to images, text in raster images, image map regions, animations (e.g., animated GIFs), ASCII art, images used as list bullets, spacers, graphical buttons, sounds (played with or without user interaction), stand-alone audio files, audio tracks of video, and video.
Note:
Scripts, applets, and programmatic objects are not covered in this definition and are addressed in checkpoint 4.3.[I#333]
Non-text content includes images, text in raster images, image map regions, animations (e.g., animated GIFs), applets and programmatic objects, ASCII art, scripts, images used as list bullets, spacers, graphical buttons, sounds (played with or without user interaction), stand-alone audio files, audio tracks of video, and video.
The term operable includes the concept of efficiency. That is, it implies that the device can be operated from the keyboard in a reasonably efficient fashion. Using MouseKeys or having to tab dozens of times to move through a small section of a document or page, or other unreasonably inefficient keyboard access would not qualify. If a document has a very large number of links, some mechanism other than tabbing through them one at a time needs to be provided. This might include provision of headers (for header navigation), the use of skip navigation, links, etc.[I#346]
Presentation is the rendering of the content and structure in a form that can be sensed by the user.
Real-time events are those that are based on the occurrence of events in real-time where the events are not under the control of the author.
A site navigation mechanism is a mechanism for easily orienting and moving about within the site. Site navigation mechanisms include but are not limited to:
A home page with hyperlinks on it and subsequent pages that link to the other pages at the site
site map(s)
search engine(s)
expanding outline(s)
dynamic fisheye views showing all linked pages or topics related to any page.
3-D virtual representations of site content
Structure includes both hierarchical structure of the content and non-hierarchical relationships such as cross-references, or the correspondence between header and data cells in a table.
The structure of content represents changes in context. For example,
A book is divided into chapters, paragraphs, lists, etc. Chapter titles help the reader anticipate the meaning of the following paragraphs. Lists clearly indicate separate, yet related ideas. All of these divisions help the reader anticipate changes in context.
A bicycle is divided into wheels and a frame. Further, a wheel is divided into a tire and a rim. In an image of the bicycle, one group of circles and lines becomes "wheel" while another group becomes "frame."
A technology is a
markup or programming language
application Programming Interface (API)
or communication protocol
A text equivalent
serves the same function as the non-text content was intended to serve.
communicates the same information as the non-text content was intended to convey.
may contain structured content or metadata.
Note:
Text-equivalents should be easily convertible to braille or speech, displayed in a larger font or different colors, fed to language translators or abstracting software, etc.
A time-dependent presentation is a presentation that
is composed of synchronized audio and visual tracks (e.g., a movie), OR
requires the user to respond interactively at specific times in the presentation.
Content might be unfamiliar if you are using terms specific to a particular community. For example, many of the terms used in this document are specific to the disability community.
Since the release of WCAG 1.0 in May 1999, the WCAG Working Group has received feedback on priorities of checkpoints, the usability of the set of documents, and requests for clarifications on the meaning of specific checkpoints and what is needed to satisfy them. Thus, it is intended that WCAG 2.0, when it eventually becomes a W3C Recommendation:
will be more efficiently organized,
may adjust the priority of some checkpoints,
may modify, remove, or add requirements due to changes in Web technologies since the publication of WCAG 1.0,
will incorporate the Errata from WCAG 1.0,
will reflect the experience gained in implementing WCAG 1.0.
For a checkpoint by checkpoint comparison, refer to the Checkpoint Mapping Between WCAG 1.0 and the WCAG 2.0 Working Draft.
We hope that WCAG 2.0 will have several improvements over WCAG 1.0. While the primary goal of WCAG 2.0 is the same as WCAG 1.0 (to promote accessibility of Web content) additional goals for WCAG 2.0 include improvements that will:
Ensure that requirements may be applied across technologies
Ensure that the conformance requirements are clear
Ensure that the deliverables are easy to use
Write to a more diverse audience
Clearly identify who benefits from accessible content
Ensure that the revision is "backward compatible" with WCAG 1.0
For more information about the intended improvements in WCAG 2.0 Working Draft, please refer to Requirements for WCAG 2.0.