- From: Bert Bos <bert@w3.org>
- Date: Wed, 27 Sep 2006 12:08:17 +0200
- To: www-style@w3.org
- Cc: Sergey Ignatchenko <sergey@ignatchenko.com>
On Friday 22 September 2006 01:33, Sergey Ignatchenko wrote:
> Sorry if asking about obvious things, but I have certain problems
> understanding the following aspects of CSS2.1 4.1.1 and Appendix G.
> Any advise will be appreciated.
The working group is reluctant to make changes at this late stage, even
if they are just to clarify the text. The existing text, though
sometimes difficult to read, is believed to be correct, while any
"clarified" text might actually introduce mistakes.
That said, here are the responses in detail:
>
> 1. 4.1.1 says:
> "UNICODE-RANGE U\+[0-9A-F?]{1,6}(-[0-9A-F]{1,6})?"
> should the first question mark really be here? what would be the
> meaning of construct "U+A?6??C"? Also the fate of UNICODE-RANGE is
> not clear at all; it doesn't seem to be mentioned anywhere else in
> the document (if it is a legacy from CSS1, which seems the most
> likely guess, it would be a good thing to clarify it's potential use
> or uselessness).
The UNICODE-RANGE token is used in the WebFonts module. WebFonts are no
longer part of level 2, but they are still part of CSS. Chapter 4 is
meant to define the syntax CSS for all times: different levels and
different profiles may use different features of CSS, but the parsing
rules are always the same.
So you are right that UNICODE-RANGE is not used in CSS 2.1, but we
cannot remove the token.
You are also right that the UNICODE-RANGE token allows to express
Unicode ranges that make no sense. E.g., "U+?0" is every 16th Latin-1
character, i.e., [ 0@P`p°ÀÐàð]. But there is no harm in such
expressions either and it makes parsing the token easier.
>
> 2. Both 4.1.1 and Appendix G.2 say that 2nd (supposedly unquoted?)
> form of URI is described as follows:
> "url\({w}([!#$%&*-~]|{nonascii}|{escape})*{w}\)"
> some set of characters seems to be missing here; for example, I don't
> see how example from 4.3.4
> ("url(http://www.example.com/redball.png)") can fit into this
> description (as well as another description "url\({w}{string}{w}\)").
I admit that you have to read the definition very carefully, but it is
actually correct. There is a "-" between the "*" and the "~" in the set
of characters and that means that all characters with code points
between "*" and "~" are included in the set. And that happens to cover
nearly all ASCII characters.
>
> 3. In G.2 YACC/FLEX grammar 'S' is defined as a single space ("[
> \t\r\n\f]"), unlike 'S' in 4.1.1 ("[ \t\r\n\f]+"). While not a big
> deal, it looks a bit confusing.
As you said, neither of them is wrong. But we looked carefully at the
grammars and decided to take the risk and change the token S in the
appendix. In the next draft it will be "[ \t\r\n\f]+", just like in
chapter 4.
For the CSS WG,
Bert
--
Bert Bos ( W 3 C ) http://www.w3.org/
http://www.w3.org/people/bos W3C/ERCIM
bert@w3.org 2004 Rt des Lucioles / BP 93
+33 (0)4 92 38 76 92 06902 Sophia Antipolis Cedex, France
Received on Wednesday, 27 September 2006 10:08:33 UTC