W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > September 2011

[Bug 14032] New: Attribute values cannot contain ambiguous ampersands

From: <bugzilla@jessica.w3.org>
Date: Sun, 04 Sep 2011 22:31:12 +0000
To: public-html-bugzilla@w3.org
Message-ID: <bug-14032-2486@http.www.w3.org/Bugs/Public/>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=14032

           Summary: Attribute values cannot contain ambiguous ampersands
           Product: HTML WG
           Version: unspecified
          Platform: All
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HTML5: The Markup Language (editor: Michael(tm) Smith)
        AssignedTo: mike@w3.org
        ReportedBy: chealer@gmail.com
         QAContact: public-html-bugzilla@w3.org
                CC: public-html-wg-issue-tracking@w3.org,
                    public-html@w3.org


As indicated in http://dev.w3.org/html5/spec/syntax.html#attributes-0 :

Attribute values are a mixture of text and character references, except with
the additional restriction that the text cannot contain an ambiguous ampersand.

The reference does not mention that:

attribute values can contain text and character references, with additional
restrictions depending on whether they are unquoted attribute values,
single-quoted attribute values, or double-quoted attribute values. Also, the
HTML elements section of this reference describes further restrictions on the
allowed values of particular attributes, and attributes must have values that
conform to those restrictions.

http://dev.w3.org/html5/markup/syntax.html#syntax-attributes


Note that the definition of ambiguous ampersands differs from the
specification's. The specification says:

An ambiguous ampersand is a U+0026 AMPERSAND character (&) that is followed by
one or more characters in the range U+0030 DIGIT ZERO (0) to U+0039 DIGIT NINE
(9), U+0061 LATIN SMALL LETTER A to U+007A LATIN SMALL LETTER Z, and U+0041
LATIN CAPITAL LETTER A to U+005A LATIN CAPITAL LETTER Z, followed by a U+003B
SEMICOLON character (;), where these characters do not match any of the names
given in the named character references section.

The reference says:

An ambiguous ampersand is an "&" character that is followed by some text other
than a space character, a "<", character, or another "&" character.

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Sunday, 4 September 2011 22:31:13 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 16:31:18 UTC