- From: by way of Martin Duerst <etanwexler@comcast.net>
- Date: Sat, 29 Nov 2003 09:14:46 -0500
- To: www-international@w3.org
Richard Ishida wrote to <mailto:www-international@w3.org> on 27 November 2003 in "New test page: UTF-8 signature / BOM" (<mid:004c01c3b4fe$6e7414e0$6601a8c0@w3c40upc3ma3j2>): >[ Note that I've also seen the first line or so of external CSS style >sheets fail if a utf-8 signature is present. If I can remember how to >replicate the failure, I'll write another test file to cover that. ] The effect that you observe with Cascading Style Sheets is not a failure according to the CSS2 Recommendation. In short, the byte order mark (U+FEFF zero width no-break space) counts as an identifier component. CSS level 2 specifies that any character from U+00A1 to U+FFFFFF can appear bare in an identifier or starting an identifier [CAC2]. Level 2.1 (a work in progress as of 27 November 2003) has the same allowance [CAC21]. Suppose I have a single-ruleset style sheet: td { padding: 1ex; } Now suppose that my CSS editor prepends a BOM to the style sheet. According to specification, the effect should be the same as if the style sheet were: \FEFFtd { padding: 1ex; } In other words, the CSS engine has a selector that matches against any element whose element-type name is the sequence U+FEFF, U+0074, U+0064. The selector must not match against "td" elements. The syntax module in level 3 (a work in progress as of 27 November 2003) [SYN3] is adapting to the times by allowing an initial U+FEFF as an encoding signature rather than as an identifier character: "A byte order mark (BOM), as described in section 2.7 of [UNICODE310], that begins the sequence of characters should not be considered, for purposes of applying the grammar below, as a part of the style sheet." CSS level 1 didn't allow U+FEFF to appear in style sheets (although its representation through numeric escapes was permitted) [SYN1]. This is mostly a historical footnote; CSS level 1, although officially a Recommendation, has the effective status of a superseded Candidate Recommendation. [CAC2] Bert Bos; H虧on Wium Lie; Chris Lilley; Ian Jacobs. "Characters and case", section 4.1.3 of CSS level 2 specification. W3C Recommendation. 12 May 1998. <http://www.w3.org/TR/REC-CSS2/syndata.html#q4>. [CAC21] Bert Bos; Tantek ヌelik; Ian Hickson; H虧on Wium Lie. "Characters and case", section 4.1.3 of CSS level 2.1 specification. W3C Working Draft. 15 September 2003. <http://www.w3.org/TR/2003/WD-CSS21-20030915/syndata.html#q6> [SYN3] L. David Baron, editor. "CSS style sheet representation", section 3 of CSS3 syntax module. W3C Working Draft. 13 August 2003. <http://www.w3.org/TR/2003/WD-css3-syntax-20030813/#css-style>. [SYN1] H虧on Wium Lie;Bert Bos. "CSS1 grammar", Appendix B of revised CSS1 specification. W3C Recommendation. 11 January 1999. <http://www.w3.org/TR/REC-CSS1#appendix-b>.
Received on Saturday, 29 November 2003 10:10:43 UTC