html5/spec Overview.html,1.926,1.927

Update of /sources/public/html5/spec
In directory hutz:/tmp/cvs-serv13212

Modified Files:
	Overview.html 
Log Message:
Remove U+000B LINE TABULATION (VT) from the list of characters treated like a space character in HTML, for consistency with CSS. This is incompatible with IE7, but apparently IE has all kinds of complications here and in any case IE8 doesn't treat U+000B as a space character. (whatwg r1738)

Index: Overview.html
===================================================================
RCS file: /sources/public/html5/spec/Overview.html,v
retrieving revision 1.926
retrieving revision 1.927
diff -u -d -r1.926 -r1.927
--- Overview.html	10 Jun 2008 21:20:02 -0000	1.926
+++ Overview.html	10 Jun 2008 21:32:37 -0000	1.927
@@ -2453,13 +2453,7 @@
 
    <dd>
     <p>Implementations must support some version of the Media Queries
-     language. However, when applying the rules of the Media Queries
-     specification to media queries found in content attributes of <a
-     href="#html-elements">HTML elements</a>, user agents must act as if all
-     U+000B LINE TABULATION characters in the attribute were in fact U+0020
-     SPACE characters. This is required to provide a consistent processing of
-     <a href="#space" title="space character">space characters</a> in HTML.
-     <a href="#references">[MQ]</a></p>
+     language. <a href="#references">[MQ]</a></p>
   </dl>
 
   <p>This specification does not require support of any particular network
@@ -4570,8 +4564,8 @@
 
   <p>The <dfn id=space title="space character">space characters</dfn>, for
    the purposes of this specification, are U+0020 SPACE, U+0009 CHARACTER
-   TABULATION (tab), U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-   FORM FEED (FF), and U+000D CARRIAGE RETURN (CR).
+   TABULATION (tab), U+000A LINE FEED (LF), U+000C FORM FEED (FF), and U+000D
+   CARRIAGE RETURN (CR).
 
   <p>Some of the micro-parsers described below follow the pattern of having
    an <var title="">input</var> variable that holds the string being parsed,
@@ -8439,11 +8433,11 @@
 
   <p>An <dfn id=ascii-compatible>ASCII-compatible character encoding</dfn> is
    one that is a superset of US-ASCII (specifically, ANSI_X3.4-1968) for
-   bytes in the range 0x09 - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F,
-   0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any character
-  sets we want to support do things outside that range?  -->.
-   <!-- XXX
-  #refs RFC1345 ? -->
+   bytes in the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
+   0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
+  character sets we want to support do things outside that range?
+  -->.
+   <!-- XXX #refs RFC1345 ? -->
 
   <p>Authors should not use JIS_X0212-1990, x-JIS0208, and encodings based on
    EBCDIC. Authors should not use UTF-32. Authors must not use the CESU-8,
@@ -33490,14 +33484,16 @@
     <p>Otherwise, if any of the first <var title="">n</var> bytes of the
      resource are in one of the following byte ranges:</p>
     <!-- This byte list is based on RFC 2046 Section 4.1.2. Characters
-    in the range 0x00-0X1F, with the exception of 0x09 - 0x0D (ASCII
-    for TAB, LF, VT, FF, and CR), and character 0x1B (reportedly used
-    by some encodings as a shift escape), are invalid. Thus, if we see
-    them, we assume it's not text. -->
+    in the range 0x00-0x1F, with the exception of 0x09, 0x0A, 0x0C,
+    0x0D (ASCII for TAB, LF, FF, and CR), and character 0x1B
+    (reportedly used by some encodings as a shift escape), are
+    invalid. Thus, if we see them, we assume it's not text. -->
     
     <ul class=brief>
      <li> 0x00 - 0x08
 
+     <li> 0x0B
+
      <li> 0x0E - 0x1A
 
      <li> 0x1C - 0x1F
@@ -33597,10 +33593,10 @@
            whitespace to be skipped when sniffing for a type signature.</p>
 
           <p>If the <var title="">index<sub>stream</sub></var>th byte of the
-           stream is one of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
-           VT), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space), then
-           increment only the <var title="">index<sub>stream</sub></var> to
-           the next byte in the byte stream.</p>
+           stream is one of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+           FF), 0x0D (ASCII CR), or 0x20 (ASCII space), then increment only
+           the <var title="">index<sub>stream</sub></var> to the next byte in
+           the byte stream.</p>
 
           <p>Otherwise, increment only the <var
            title="">index<sub>pattern</sub></var> to the next byte in the
@@ -34063,16 +34059,16 @@
      such match is found, return nothing.</p>
 
    <li>
-    <p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters
-     that immediately follow the word 'charset' (there might not be any).
+    <p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020 characters that
+     immediately follow the word 'charset' (there might not be any).
 
    <li>
     <p>If the next character is not a U+003D EQUALS SIGN ('='), return
      nothing.
 
    <li>
-    <p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters
-     that immediately follow the equals sign (there might not be any).
+    <p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020 characters that
+     immediately follow the equals sign (there might not be any).
 
    <li>
     <p>Process the next character as follows:</p>
@@ -34101,7 +34097,7 @@
 
      <dd>
       <p>Return the string from this character to the first U+0009, U+000A,
-       U+000B, U+000C, U+000D, U+0020, or U+003B character or the end of <var
+       U+000C, U+000D, U+0020, or U+003B character or the end of <var
        title="">s</var>, whichever comes first.
     </dl>
   </ol>
@@ -42047,10 +42043,9 @@
    of the string "<code title="">&lt;/</code>" (U+003C LESS-THAN SIGN, U+002F
    SOLIDUS) followed by characters that case-insensitively<!--ASCII--> match
    the tag name of the element followed by one of U+0009 CHARACTER
-   TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
-   FEED (FF), U+0020 SPACE, U+003E GREATER-THAN SIGN (>), or U+002F SOLIDUS
-   (/), unless that string is part of an <a href="#escaping"
-   title=syntax-escape>escaping text span</a>.
+   TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+0020 SPACE,
+   U+003E GREATER-THAN SIGN (>), or U+002F SOLIDUS (/), unless that string is
+   part of an <a href="#escaping" title=syntax-escape>escaping text span</a>.
 
   <p>An <dfn id=escaping title=syntax-escape>escaping text span</dfn> is a
    span of <a href="#text2" title=syntax-text>text</a> that starts with an <a
@@ -42420,15 +42415,15 @@
 
        <dt>A sequence of bytes starting with: 0x3C, 0x4D or 0x6D, 0x45 or
         0x65, 0x54 or 0x74, 0x41 or 0x61, and finally one of 0x09, 0x0A,
-        0x0B, 0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '&lt;meta'
-        followed by a space or slash)
+        0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '&lt;meta' followed by
+        a space or slash)
 
        <dd>
         <ol>
          <li>
           <p>Advance the <var title="">position</var> pointer so that it
-           points at the next 0x09, 0x0A, 0x0B, 0x0C, 0x0D, 0x20, or 0x2F
-           byte (the one in sequence of characters matched above).
+           points at the next 0x09, 0x0A, 0x0C, 0x0D, 0x20, or 0x2F byte (the
+           one in sequence of characters matched above).
 
          <li>
           <p><a href="#get-an" title=concept-get-attributes-when-sniffing>Get
@@ -42478,9 +42473,9 @@
         <ol>
          <li>
           <p>Advance the <var title="">position</var> pointer so that it
-           points at the next 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
-           VT), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E
-           (ASCII '>') byte.
+           points at the next 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+           FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
+           byte.
 
          <li>
           <p>Repeatedly <a href="#get-an"
@@ -42518,9 +42513,9 @@
     <ol>
      <li>
       <p>If the byte at <var title="">position</var> is one of 0x09 (ASCII
-       TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII FF), 0x0D (ASCII
-       CR), 0x20 (ASCII space), or 0x2F (ASCII '/') then advance <var
-       title="">position</var> to the next byte and redo this substep.
+       TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII
+       space), or 0x2F (ASCII '/') then advance <var title="">position</var>
+       to the next byte and redo this substep.
 
      <li>
       <p>If the byte at <var title="">position</var> is 0x3E (ASCII '>'),
@@ -42542,8 +42537,8 @@
        <dd>Advance <var title="">position</var> to the next byte and jump to
         the step below labeled <em>value</em>.
 
-       <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C
-        (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space)
+       <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D
+        (ASCII CR), or 0x20 (ASCII space)
 
        <dd>Jump to the step below labeled <em>spaces</em>.
 
@@ -42575,9 +42570,9 @@
 
      <li>
       <p><em>Spaces.</em> If the byte at <var title="">position</var> is one
-       of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII
-       FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then advance <var
-       title="">position</var> to the next byte, then, repeat this step.
+       of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII
+       CR), or 0x20 (ASCII space) then advance <var title="">position</var>
+       to the next byte, then, repeat this step.
 
      <li>
       <p>If the byte at <var title="">position</var> is <em>not</em> 0x3D
@@ -42590,9 +42585,9 @@
 
      <li>
       <p><em>Value.</em> If the byte at <var title="">position</var> is one
-       of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII
-       FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then advance <var
-       title="">position</var> to the next byte, then, repeat this step.
+       of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII
+       CR), or 0x20 (ASCII space) then advance <var title="">position</var>
+       to the next byte, then, repeat this step.
 
      <li>
       <p>Process the byte at <var title="">position</var> as follows:</p>
@@ -42653,8 +42648,8 @@
       <p>Process the byte at <var title="">position</var> as follows:</p>
 
       <dl class=switch>
-       <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C
-        (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
+       <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D
+        (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
 
        <dd>Abort the "get an attribute" algorithm. The attribute's name is
         the value of <var title="">attribute name</var> and its value is the
@@ -43686,8 +43681,6 @@
 
      <li>U+000A LINE FEED (LF)
 
-     <li>U+000B LINE TABULATION
-
      <li>U+000C FORM FEED (FF)</li>
      <!--<li>U+000D CARRIAGE RETURN (CR)</li>-->
 
@@ -43751,8 +43744,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -43799,8 +43790,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -43858,8 +43847,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -43924,8 +43911,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -43978,8 +43963,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44092,8 +44075,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44156,8 +44137,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44412,8 +44391,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44437,8 +44414,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44475,8 +44450,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44512,8 +44485,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44561,8 +44532,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44675,8 +44644,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44726,8 +44693,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44840,8 +44805,6 @@
 
      <dt>U+000A LINE FEED (LF)
 
-     <dt>U+000B LINE TABULATION
-
      <dt>U+000C FORM FEED (FF)</dt>
      <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -44927,8 +44890,6 @@
 
    <dt>U+000A LINE FEED (LF)
 
-   <dt>U+000B LINE TABULATION
-
    <dt>U+000C FORM FEED (FF)</dt>
    <!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
 
@@ -45706,7 +45667,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -46024,7 +45985,7 @@
      the comment token.</p>
 
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -46091,7 +46052,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -46161,7 +46122,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -46438,7 +46399,7 @@
      title="insertion mode: in head">in head</a>".</p>
 
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dt>A comment token
@@ -46479,7 +46440,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -47714,7 +47675,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -47962,7 +47923,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -48659,7 +48620,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -48723,7 +48684,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -48819,7 +48780,7 @@
 
   <dl class=switch>
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
@@ -48892,7 +48853,7 @@
    <dt>A DOCTYPE token
 
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dt>A start tag whose tag name is "html"
@@ -48934,7 +48895,7 @@
    <dt>A DOCTYPE token
 
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
-    U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+    U+000A LINE FEED (LF), U+000C FORM FEED (FF),
     <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dt>A start tag whose tag name is "html"

Received on Tuesday, 10 June 2008 21:33:16 UTC