- From: Eric Hansen <ehansen7@hotmail.com>
- Date: Tue, 19 Sep 2000 06:41:51 EDT
- To: w3c-wai-ua@w3.org
- Cc: ehansen@ets.org
To: UA List (w3c-wai-ua@w3.org)
From: Eric Hansen
Re: Non-Text Content, Text Element, Etc.
Suggestion 1: Provide a definition of "non-text content", etc.
In keeping with the working group's resolution in the last teleconference
[1] to "Adopt a definition of non-text content, text element, non-text
element", following is my wording.
New:
"Non-Text Content, Text Content, Non-Text Element, Text Element"
"The term non-text content in this document refers to _content_ that is
composed of one or more "non-text elements". Per checkpoint 1.1 of WCAG 1.0,
the content author must ensure that there is a "text equivalent" for each
non-text element in author-supplied content. Similarly, the developer of a
user agent must ensure that a text equivalent is available for any non-text
element produced by the user agent for the user (see checkpoint 1.5). The
term "text content" in this document refers to content that is composed of
one or more "text elements.""
"A "text element" is an element that, when rendered, is understandable in
_each_ of the three modes: (1) visually-displayed text (for person who is
deaf and adept in reading visually-displayed text); (2) synthesized speech
(for a person who is blind and adept in use of synthesized speech); and (3)
braille (for a person who is deaf-blind and adept at reading braille). A
text element may contain markup for structure (e.g., heading levels), and
style (e.g., font size or color), and so on. However, the essential function
of the text element should be retained even if style information happens to
be lost in rendering."
"A non-text element is an element that _fails_ to be understandable when
rendered in _any_ of three modes to their respective disability audiences."
"Note that the terms text element and non-text element are defined by the
characteristics of their output (rendering) rather than those of their
input. For example, in principle, a text equivalent can be generated or
encoded in any fashion as long as it has the proper output characteristics.
[EH Question: Do we need to say: "However, in many cases, text elements are
encoded as text -- with or without markup."? Or do we need, for practical or
other reasons, to otherwise constrain the format of the text equivalent?] A
text equivalent may be understood as "pre-rendering" content in contrast to
the "post-rendering" content that it produces (visually-displayed text,
synthesized speech, braille).
A _text equivalent_ is a text element that, when rendered, serves
essentially the same function as some other content (e.g., "primary"
content) does for a person without any disability (see definition of
_alternative equivalents_).
Comment 1:
I think that it is imprecise and misleading to define text equivalents as
"unrendered content" and the content that they produce (visually-displayed
text, synthesized speech, and braille) as the "rendered content". While
those terms are often correct in particular instances, they don't rise to
the level of defining characteristics. I think that the terms "pre-rendering
content" and "post-rendering content", while perhaps a bit unfamiliar, are
correct.
Comment 2:
I think that the definition of text element could very appropriately include
the requirements of WCAG 1.0 checkpoint 14.1 ("Use the clearest and simplest
language appropriate for a site's content. [Priority 1]"). For example, if a
text equivalent fails to adhere to checkpoint 14.1, the whole document fails
to conform at the WCAG 1.0. However, I am not sure that it is essential in
this context.
Comment 3:
Another possibly essential part of the definition could be as follows: "The
'visually-displayed text' must be composed of one or more characters from
any of the standard [?] character sets. That is, if one disregards stylistic
features of the text, the visually displayed text will look like standard
graphical representations of the characters (or glyphs) in the character
set." One could also affirm similar things for braille output, thought I
don't think that we have to do this for this document. See suggestion this
memo regarding the definition of "text".
Comment 4:
I don't think that the term "text content" now appears in the document other
than in this definition. However, it is found in WCAG 1.0 in a way that
seems consistent with the proposed definition ("Text content can be
presented to the user as synthesized speech, braille, and visually-displayed
text. Each of these three mechanisms uses a different sense -- ears for
synthesized speech, tactile for braille, and eyes for visually-displayed
text -- making the information accessible to groups representing a variety
of sensory and other disabilities.").
Comment 5:
Under the proposed definitions, user agents must expect that _markup_ may be
found in text equivalents. However, I don't think that we normally think of
values of the "alt" attribute as containing markup. Should we specify that
user agents need to be ready to recognize the contents either as: plain
text, as marked up text, as a URI of a file to open or execute? How can they
be expected to recognize them as such?
====
Suggestion 2: Fix checkpoint 7.5.
Change the term "rendered text content" simply to "text rendered from the
DOM". The term "text content" has a special meaning (i.e., content that is
composed of one or more text elements).
Old (1 September 2000):
"7.5 Allow the user to search for rendered text content, including rendered
text equivalents. Allow the user to start a forward search from a location
in content selected or focused by the user. After a match, allow searching
from location of the match. Provide a case-insensitive search option when
applicable to the natural language of text. [Priority 2]"
New:
"7.5 Allow the user to search for text rendered from the DOM, including
rendered text equivalents. Allow the user to start a forward search from a
location in content selected or focused by the user. After a match, allow
searching from location of the match. Provide a case-insensitive search
option when applicable to the natural language of text. [Priority 2]"
====
Suggestion 3: Fix the definition of "Configure and Control".
Change the term "all text content" simply to "all text from the DOM". The
term "text content" has a special meaning (i.e., content that is composed of
one or more text elements).
Old (1 September 2000):
"For example, users may configure the user agent to apply the same font
family across Web resources, so that all text content is displayed by
default using that font family. Or, the user may wish to configure the
rendering of a particular element type, which may be done through style
sheets."
New:
"For example, users may configure the user agent to apply the same font
family across Web resources, so that all text from the DOM is displayed by
default using that font family. Or, the user may wish to configure the
rendering of a particular element type, which may be done through style
sheets."
====
Suggestion 4: Fix the definition of "Content".
Old (1 September 2000):
"Content"
"In this specification, the term "content" is used in two ways:"
"1. Content refers to the document object as a whole or in parts. Phrases
such as "content type", "text content", and "language of content" refer to
this usage. When used in this sense, the term content encompasses equivalent
alternatives. Refer also to the definition of rendered content. and other
accessibility information."
"2. Content refers to the content of an HTML or XML element, in the sense
employed by the XML 1.0 specification ([XML], section 3.1): "The text
between the start-tag and end-tag is called the element's content." Context
should indicate that the term content is being used in this sense."
New:
"Content"
"In this specification, the term "content" is used in three ways:"
"1. Content refers to the DOM document object as a whole or in parts.
Phrases such as "content type" and "language of content" refer to this
usage. When used in this sense, the term content encompasses equivalent
alternatives. Refer also to the definition of rendered content <NOTE PERIOD
DELETED> and other accessibility information."
"2. Content refers to the content of an HTML or XML element, in the sense
employed by the XML 1.0 specification ([XML], section 3.1): "The text
between the start-tag and end-tag is called the element's content." Context
should indicate that the term content is being used in this sense."
"3. Content is used in the context of the phrase "non-text content" and
"text content". See definition of _non-text content_."
Comment 1:
As far as I know, in the current document, all instances of meaning #3 are
also instances of meaning #1. However, I think that the definitions must be
separate.
Comment 2:
I think we need to make clear that when we refer to "content", we are
referring to the DOM (i.e., DOM2), not to any other kind of "document
object". (No Subject can conform unless it exports the DOM2 DOM.)
Comment 3:
I wonder if the term "content type" really should be "media type". Or does
DOM2 specifically refer to "content types"?
====
Suggestion 5: Fix the definition of Document Object.
Old (1 September 2000):
Document Object, Document Object Model
The document object is the user agent's representation of data (e.g., a
document). This data generally comes from the document source, but may also
be generated (from style sheets, scripts, transformations, etc.) or produced
as a result of preferences set within the user agent. Some data that is part
of the document object is routinely rendered (e.g., in HTML, what appears
between the start and end tags of elements and the values of attributes such
as "alt", "title", and "summary"). Other parts of the document object are
generally processed invisibly by the user agent, such as DTD-defined names
of element types and attributes, and other attribute values such as "href",
"id", etc. These guidelines require that users have access to both types of
data through the user interface.
A document object model is the abstraction that governs the construction of
the user agent's document object. The document object model employed by
different user agents will vary in implementation and sometimes in scope.
Nevertheless, this document calls for developers of user agents to adhere to
the W3C Document Object Model (DOM), which specifies a standard interface
for accessing HTML and XML content. This standard interface allows authors
to access and modify the document with a scripting language (e.g.,
JavaScript) in a consistent manner across different scripting languages. As
a standard interface, use of a W3C DOM makes it easier not just for authors
but for assistive technology developers to extract information and render it
in ways most suited to the needs of particular users. The relevant W3C DOM
Recommendations are listed in the references. In this specification, the
acronym "DOM" refers to the W3C DOM.
New:
Document Object, Document Object Model
In general usage, the term "document object" refers to the user agent's
representation of data (e.g., a document). This data generally comes from
the document source, but may also be generated (from style sheets, scripts,
transformations, etc.) or produced as a result of preferences set within the
user agent. Some data that is part of the document object is routinely
rendered (e.g., in HTML, what appears between the start and end tags of
elements and the values of attributes such as "alt", "title", and
"summary"). Other parts of the document object are generally processed
invisibly by the user agent, such as DTD-defined names of element types and
attributes, and other attribute values such as "href", "id", etc. These
guidelines require that users have access to both types of data through the
user interface.
A "document object model" is the abstraction that governs the construction
of the user agent's document object. The document object model employed by
different user agents will vary in implementation and sometimes in scope.
In the context of requirements of this document, the terms "document object"
and "document object model" (DOM) refer specifically to the W3C DOM2, which
specifies a standard interface for accessing HTML and XML content. This
standard interface allows authors to access and modify the document with a
scripting language (e.g., JavaScript) in a consistent manner across
different scripting languages. As a standard interface, use of a W3C DOM
makes it easier not just for authors but for assistive technology developers
to extract information and render it in ways most suited to the needs of
particular users. The relevant W3C DOM Recommendations are listed in the
references.
====
Suggestion 6: Fix checkpoint 6.1.
Make clear which conformance levels of WCAG 1.0 and ATAG 1.0 are intended.
Is it a Priority 1 UAAG requirement to ensure that features that support
_all three_ WCAG and ATAG priority levels are implemented. I am not sure
whether or not I think that all three levels is too stringent. (I can see
that this is different than the case of documentation in which we decided
the double-A WCAG conformance was adequate.) But in any case, I think that
there needs to be a note indicating which priority levels are intended.
Old (1 September 2000):
"6.1 Implement the accessibility features of all supported specifications
(markup languages, style sheet languages, metadata languages, graphics
formats, etc.). Accessibility features are those identified in the
specification and those features of the specification that support
requirements of the "Web Content Accessibility Guidelines 1.0" [WCAG10], the
"Authoring Tool Accessibility Guidelines 1.0" [ATAG10], and the current
document. [Priority 1]"
"Note: This checkpoint applies to all specifications, not just W3C
specifications. The Techniques document [UAAG10-TECHS] provides information
about the accessibility features of some specifications, including W3C
specifications."
"Techniques for checkpoint 6.1"
====
Suggestion 7: Add a definition of "text".
Per our earlier discussion [1], there should be a definition of "text".
I think that we need to be very thoughtful and deliberate about adding such
a definition. If we specify some set of character sets, then what do we mean
-- (a) that those sets must _all_ be supported or (b) that user agents are
_not required_ to support anything outside of them or (c) that they _cannot_
support anything outside them?
If we name a specific set of character sets as what we mean, then we need to
carefully examine the contexts of its use. I am not sure how fully various
character sets are supported in braille and synthesized speech. Perhaps we
need not have the definition specifically refer to braille and synthesized
speech renderings.
Without necessary naming a particular set of character sets, it might be
valuable to simply make clear that when we refer to text we are referring to
the stuff of _character sets_ rather than to things like sign language
videos. (The term 'text' is sometimes used outside the document to encompass
such divergent things.)
====
Suggestion 8: Confirm the adequacy of the word "content" as used on
checkpoints 2.5, 2.6, and 3.8.
I believe that is the foregoing changes are made, then the current wording
of checkpoints 2.5, 2.6, and 3.8 appears adequate.
[1] http://lists.w3.org/Archives/Public/w3c-wai-ua/2000JulSep/0387.html
======================
_________________________________________________________________________
Get Your Private, Free E-mail from MSN Hotmail at http://www.hotmail.com.
Share information about yourself, create your own public profile at
http://profiles.msn.com.
Received on Tuesday, 19 September 2000 06:42:23 UTC