[Bug 17514] New: URI token should be agnostic to escaping the characters u, r, l from bugzilla@jessica.w3.org on 2012-06-16 (public-css-bugzilla@w3.org from June 2012)

From: <bugzilla@jessica.w3.org>
Date: Sat, 16 Jun 2012 14:28:21 +0000
To: public-css-bugzilla@w3.org
Message-ID: <bug-17514-5148@http.www.w3.org/Bugs/Public/>

https://www.w3.org/Bugs/Public/show_bug.cgi?id=17514

           Summary: URI token should be agnostic to escaping the
                    characters u, r, l
           Product: CSS
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: CSS Level 2
        AssignedTo: bert@w3.org
        ReportedBy: antonsforums@yahoo.co.uk
         QAContact: public-css-bugzilla@w3.org


4.1.1 (Tokenization) defines the URI token as:

  # URI    url\({w}{string}{w}\)
  #       |url\({w}([!#$%&*-\[\]-~]|{nonascii}|{escape})*{w}\)

(and similarly for BAD_URI), whilst G.1 gives:

  # "url("{w}{string}{w}")"   {return URI;}
  # "url("{w}{url}{w}")"    {return URI;}

(and similarly for BAD_URI).

This means that if you escape any of the characters 'u', 'r', 'l' in a property
value intended to match the <uri> value type, then what you might have expected
to have tokenized as URI is actually tokenized as FUNCTION.

However, this doesn't match UAs; Trident, Gecko and Presto all allow the
characters to be escaped and still invoke the normal URI token parsing.

The spec should be changed to match reality.

Conversation begins:
Bug report:
http://lists.w3.org/Archives/Public/www-style/2012May/0327.html

-- 
Configure bugmail: https://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.

Received on Saturday, 16 June 2012 14:28:23 UTC