- From: Ian Hickson via cvs-syncmail <cvsmail@w3.org>
- Date: Tue, 10 Jun 2008 21:32:40 +0000
- To: public-html-commits@w3.org
Update of /sources/public/html5/spec
In directory hutz:/tmp/cvs-serv13212
Modified Files:
Overview.html
Log Message:
Remove U+000B LINE TABULATION (VT) from the list of characters treated like a space character in HTML, for consistency with CSS. This is incompatible with IE7, but apparently IE has all kinds of complications here and in any case IE8 doesn't treat U+000B as a space character. (whatwg r1738)
Index: Overview.html
===================================================================
RCS file: /sources/public/html5/spec/Overview.html,v
retrieving revision 1.926
retrieving revision 1.927
diff -u -d -r1.926 -r1.927
--- Overview.html 10 Jun 2008 21:20:02 -0000 1.926
+++ Overview.html 10 Jun 2008 21:32:37 -0000 1.927
@@ -2453,13 +2453,7 @@
<dd>
<p>Implementations must support some version of the Media Queries
- language. However, when applying the rules of the Media Queries
- specification to media queries found in content attributes of <a
- href="#html-elements">HTML elements</a>, user agents must act as if all
- U+000B LINE TABULATION characters in the attribute were in fact U+0020
- SPACE characters. This is required to provide a consistent processing of
- <a href="#space" title="space character">space characters</a> in HTML.
- <a href="#references">[MQ]</a></p>
+ language. <a href="#references">[MQ]</a></p>
</dl>
<p>This specification does not require support of any particular network
@@ -4570,8 +4564,8 @@
<p>The <dfn id=space title="space character">space characters</dfn>, for
the purposes of this specification, are U+0020 SPACE, U+0009 CHARACTER
- TABULATION (tab), U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), and U+000D CARRIAGE RETURN (CR).
+ TABULATION (tab), U+000A LINE FEED (LF), U+000C FORM FEED (FF), and U+000D
+ CARRIAGE RETURN (CR).
<p>Some of the micro-parsers described below follow the pattern of having
an <var title="">input</var> variable that holds the string being parsed,
@@ -8439,11 +8433,11 @@
<p>An <dfn id=ascii-compatible>ASCII-compatible character encoding</dfn> is
one that is a superset of US-ASCII (specifically, ANSI_X3.4-1968) for
- bytes in the range 0x09 - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F,
- 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any character
- sets we want to support do things outside that range? -->.
- <!-- XXX
- #refs RFC1345 ? -->
+ bytes in the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
+ 0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
+ character sets we want to support do things outside that range?
+ -->.
+ <!-- XXX #refs RFC1345 ? -->
<p>Authors should not use JIS_X0212-1990, x-JIS0208, and encodings based on
EBCDIC. Authors should not use UTF-32. Authors must not use the CESU-8,
@@ -33490,14 +33484,16 @@
<p>Otherwise, if any of the first <var title="">n</var> bytes of the
resource are in one of the following byte ranges:</p>
<!-- This byte list is based on RFC 2046 Section 4.1.2. Characters
- in the range 0x00-0X1F, with the exception of 0x09 - 0x0D (ASCII
- for TAB, LF, VT, FF, and CR), and character 0x1B (reportedly used
- by some encodings as a shift escape), are invalid. Thus, if we see
- them, we assume it's not text. -->
+ in the range 0x00-0x1F, with the exception of 0x09, 0x0A, 0x0C,
+ 0x0D (ASCII for TAB, LF, FF, and CR), and character 0x1B
+ (reportedly used by some encodings as a shift escape), are
+ invalid. Thus, if we see them, we assume it's not text. -->
<ul class=brief>
<li> 0x00 - 0x08
+ <li> 0x0B
+
<li> 0x0E - 0x1A
<li> 0x1C - 0x1F
@@ -33597,10 +33593,10 @@
whitespace to be skipped when sniffing for a type signature.</p>
<p>If the <var title="">index<sub>stream</sub></var>th byte of the
- stream is one of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
- VT), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space), then
- increment only the <var title="">index<sub>stream</sub></var> to
- the next byte in the byte stream.</p>
+ stream is one of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+ FF), 0x0D (ASCII CR), or 0x20 (ASCII space), then increment only
+ the <var title="">index<sub>stream</sub></var> to the next byte in
+ the byte stream.</p>
<p>Otherwise, increment only the <var
title="">index<sub>pattern</sub></var> to the next byte in the
@@ -34063,16 +34059,16 @@
such match is found, return nothing.</p>
<li>
- <p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters
- that immediately follow the word 'charset' (there might not be any).
+ <p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020 characters that
+ immediately follow the word 'charset' (there might not be any).
<li>
<p>If the next character is not a U+003D EQUALS SIGN ('='), return
nothing.
<li>
- <p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters
- that immediately follow the equals sign (there might not be any).
+ <p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020 characters that
+ immediately follow the equals sign (there might not be any).
<li>
<p>Process the next character as follows:</p>
@@ -34101,7 +34097,7 @@
<dd>
<p>Return the string from this character to the first U+0009, U+000A,
- U+000B, U+000C, U+000D, U+0020, or U+003B character or the end of <var
+ U+000C, U+000D, U+0020, or U+003B character or the end of <var
title="">s</var>, whichever comes first.
</dl>
</ol>
@@ -42047,10 +42043,9 @@
of the string "<code title=""></</code>" (U+003C LESS-THAN SIGN, U+002F
SOLIDUS) followed by characters that case-insensitively<!--ASCII--> match
the tag name of the element followed by one of U+0009 CHARACTER
- TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
- FEED (FF), U+0020 SPACE, U+003E GREATER-THAN SIGN (>), or U+002F SOLIDUS
- (/), unless that string is part of an <a href="#escaping"
- title=syntax-escape>escaping text span</a>.
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+0020 SPACE,
+ U+003E GREATER-THAN SIGN (>), or U+002F SOLIDUS (/), unless that string is
+ part of an <a href="#escaping" title=syntax-escape>escaping text span</a>.
<p>An <dfn id=escaping title=syntax-escape>escaping text span</dfn> is a
span of <a href="#text2" title=syntax-text>text</a> that starts with an <a
@@ -42420,15 +42415,15 @@
<dt>A sequence of bytes starting with: 0x3C, 0x4D or 0x6D, 0x45 or
0x65, 0x54 or 0x74, 0x41 or 0x61, and finally one of 0x09, 0x0A,
- 0x0B, 0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '<meta'
- followed by a space or slash)
+ 0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '<meta' followed by
+ a space or slash)
<dd>
<ol>
<li>
<p>Advance the <var title="">position</var> pointer so that it
- points at the next 0x09, 0x0A, 0x0B, 0x0C, 0x0D, 0x20, or 0x2F
- byte (the one in sequence of characters matched above).
+ points at the next 0x09, 0x0A, 0x0C, 0x0D, 0x20, or 0x2F byte (the
+ one in sequence of characters matched above).
<li>
<p><a href="#get-an" title=concept-get-attributes-when-sniffing>Get
@@ -42478,9 +42473,9 @@
<ol>
<li>
<p>Advance the <var title="">position</var> pointer so that it
- points at the next 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
- VT), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E
- (ASCII '>') byte.
+ points at the next 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+ FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
+ byte.
<li>
<p>Repeatedly <a href="#get-an"
@@ -42518,9 +42513,9 @@
<ol>
<li>
<p>If the byte at <var title="">position</var> is one of 0x09 (ASCII
- TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII FF), 0x0D (ASCII
- CR), 0x20 (ASCII space), or 0x2F (ASCII '/') then advance <var
- title="">position</var> to the next byte and redo this substep.
+ TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII
+ space), or 0x2F (ASCII '/') then advance <var title="">position</var>
+ to the next byte and redo this substep.
<li>
<p>If the byte at <var title="">position</var> is 0x3E (ASCII '>'),
@@ -42542,8 +42537,8 @@
<dd>Advance <var title="">position</var> to the next byte and jump to
the step below labeled <em>value</em>.
- <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C
- (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space)
+ <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D
+ (ASCII CR), or 0x20 (ASCII space)
<dd>Jump to the step below labeled <em>spaces</em>.
@@ -42575,9 +42570,9 @@
<li>
<p><em>Spaces.</em> If the byte at <var title="">position</var> is one
- of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII
- FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then advance <var
- title="">position</var> to the next byte, then, repeat this step.
+ of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII
+ CR), or 0x20 (ASCII space) then advance <var title="">position</var>
+ to the next byte, then, repeat this step.
<li>
<p>If the byte at <var title="">position</var> is <em>not</em> 0x3D
@@ -42590,9 +42585,9 @@
<li>
<p><em>Value.</em> If the byte at <var title="">position</var> is one
- of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII
- FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then advance <var
- title="">position</var> to the next byte, then, repeat this step.
+ of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII
+ CR), or 0x20 (ASCII space) then advance <var title="">position</var>
+ to the next byte, then, repeat this step.
<li>
<p>Process the byte at <var title="">position</var> as follows:</p>
@@ -42653,8 +42648,8 @@
<p>Process the byte at <var title="">position</var> as follows:</p>
<dl class=switch>
- <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C
- (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
+ <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D
+ (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
<dd>Abort the "get an attribute" algorithm. The attribute's name is
the value of <var title="">attribute name</var> and its value is the
@@ -43686,8 +43681,6 @@
<li>U+000A LINE FEED (LF)
- <li>U+000B LINE TABULATION
-
<li>U+000C FORM FEED (FF)</li>
<!--<li>U+000D CARRIAGE RETURN (CR)</li>-->
@@ -43751,8 +43744,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43799,8 +43790,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43858,8 +43847,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43924,8 +43911,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43978,8 +43963,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44092,8 +44075,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44156,8 +44137,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44412,8 +44391,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44437,8 +44414,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44475,8 +44450,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44512,8 +44485,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44561,8 +44532,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44675,8 +44644,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44726,8 +44693,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44840,8 +44805,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44927,8 +44890,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -45706,7 +45667,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -46024,7 +45985,7 @@
the comment token.</p>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -46091,7 +46052,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -46161,7 +46122,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -46438,7 +46399,7 @@
title="insertion mode: in head">in head</a>".</p>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A comment token
@@ -46479,7 +46440,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -47714,7 +47675,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -47962,7 +47923,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48659,7 +48620,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48723,7 +48684,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48819,7 +48780,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48892,7 +48853,7 @@
<dt>A DOCTYPE token
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A start tag whose tag name is "html"
@@ -48934,7 +48895,7 @@
<dt>A DOCTYPE token
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A start tag whose tag name is "html"
Received on Tuesday, 10 June 2008 21:33:16 UTC