- From: poot <cvsmail@w3.org>
- Date: Tue, 20 Oct 2009 10:20:18 +0900 (JST)
- To: public-html-diffs@w3.org
hixie: Make more consistent use of '..' vs 'to' for ranges of Unicode characters and bytes. (whatwg r4187) http://dev.w3.org/cvsweb/html5/spec/Overview.html?r1=1.3333&r2=1.3334&f=h http://html5.org/tools/web-apps-tracker?from=4186&to=4187 =================================================================== RCS file: /sources/public/html5/spec/Overview.html,v retrieving revision 1.3333 retrieving revision 1.3334 diff -u -d -r1.3333 -r1.3334 --- Overview.html 20 Oct 2009 01:15:11 -0000 1.3333 +++ Overview.html 20 Oct 2009 01:20:03 -0000 1.3334 @@ -2162,24 +2162,24 @@ </div><h3 id="case-sensitivity-and-string-comparison"><span class="secno">2.3 </span>Case-sensitivity and string comparison</h3><p class="XXX annotation"><b>Status: </b><i>Implemented and widely deployed</i><p>Comparing two strings in a <dfn id="case-sensitive">case-sensitive</dfn> manner means comparing them exactly, code point for code point.<p>Comparing two strings in an <dfn id="ascii-case-insensitive">ASCII case-insensitive</dfn> manner means comparing them exactly, code point for code point, except - that the characters in the range U+0041 .. U+005A (i.e. LATIN + that the characters in the range U+0041 to U+005A (i.e. LATIN CAPITAL LETTER A to LATIN CAPITAL LETTER Z) and the corresponding - characters in the range U+0061 .. U+007A (i.e. LATIN SMALL LETTER A + characters in the range U+0061 to U+007A (i.e. LATIN SMALL LETTER A to LATIN SMALL LETTER Z) are considered to also match.<p>Comparing two strings in a <dfn id="compatibility-caseless">compatibility caseless</dfn> manner means using the Unicode <i>compatibility caseless match</i> operation to compare the two strings. <a href="#refsUNICODE">[UNICODE]</a><div class="impl"> <p><dfn id="converted-to-ascii-uppercase" title="converted to ASCII uppercase">Converting a string to ASCII uppercase</dfn> means replacing all characters in the range - U+0061 .. U+007A (i.e. LATIN SMALL LETTER A to LATIN SMALL LETTER Z) - with the corresponding characters in the range U+0041 .. U+005A + U+0061 to U+007A (i.e. LATIN SMALL LETTER A to LATIN SMALL LETTER Z) + with the corresponding characters in the range U+0041 to U+005A (i.e. LATIN CAPITAL LETTER A to LATIN CAPITAL LETTER Z).</p> <p><dfn id="converted-to-ascii-lowercase" title="converted to ASCII lowercase">Converting a string to ASCII lowercase</dfn> means replacing all characters in the range - U+0041 .. U+005A (i.e. LATIN CAPITAL LETTER A to LATIN CAPITAL + U+0041 to U+005A (i.e. LATIN CAPITAL LETTER A to LATIN CAPITAL LETTER Z) with the corresponding characters in the range U+0061 - .. U+007A (i.e. LATIN SMALL LETTER A to LATIN SMALL LETTER Z).</p> + to U+007A (i.e. LATIN SMALL LETTER A to LATIN SMALL LETTER Z).</p> </div><p>A string <var title="">pattern</var> is a <dfn id="prefix-match">prefix match</dfn> for a string <var title="">s</var> when <var title="">pattern</var> @@ -2909,7 +2909,7 @@ <!-- <dt>A U+002E FULL STOP character</dt> --> - <dt>A character in the range U+0001 .. U+001F, <!-- space --> U+0021 .. U+002B, <!-- comma --> U+002D .. U+002F, <!-- digits --> U+003A, <!-- semicolon --> U+003C .. U+0040, <!-- a-z --> U+005B .. U+0060, <!-- A-Z --> U+007b .. U+007F + <dt>A character in the range U+0001 to U+001F, <!-- space --> U+0021 to U+002B, <!-- comma --> U+002D to U+002F, <!-- digits --> U+003A, <!-- semicolon --> U+003C to U+0040, <!-- a-z --> U+005B to U+0060, <!-- A-Z --> U+007b to U+007F (i.e. any other non-alphabetic ASCII character)</dt> <!-- @@ -7271,7 +7271,7 @@ </div><h5 id="embedding-custom-non-visible-data"><span class="secno">3.2.3.8 </span><dfn>Embedding custom non-visible data</dfn></h5><p class="XXX annotation"><b>Status: </b><i>Last call for comments</i><p>A <dfn id="custom-data-attribute">custom data attribute</dfn> is an attribute in no namespace whose name starts with the string "<dfn id="attr-data" title="attr-data-*"><code>data-</code></dfn>", has at least one character after the hyphen, is <a href="#xml-compatible">XML-compatible</a>, and - contains no characters in the range U+0041 .. U+005A (LATIN CAPITAL + contains no characters in the range U+0041 to U+005A (LATIN CAPITAL LETTER A to LATIN CAPITAL LETTER Z).<p class="note">All attributes in <a href="#html-documents">HTML documents</a> get lowercased automatically, so the restriction on uppercase letters doesn't affect such documents.<p><a href="#custom-data-attribute" title="custom data attribute">Custom data attributes</a> @@ -18029,7 +18029,7 @@ </div><p>Any (namespace-less) attribute may be specified on the <code><a href="#the-embed-element">embed</a></code> element, so long as its name is <a href="#xml-compatible">XML-compatible</a> and contains no characters in the range - U+0041 .. U+005A (LATIN CAPITAL LETTER A to LATIN CAPITAL LETTER + U+0041 to U+005A (LATIN CAPITAL LETTER A to LATIN CAPITAL LETTER Z). These attributes are then passed as parameters to the <a href="#plugin">plugin</a>.<p class="note">All attributes in <a href="#html-documents">HTML documents</a> get lowercased automatically, so the restriction on uppercase letters @@ -35084,8 +35084,8 @@ following subsubsteps:</p> <ol><!-- * - . _ 0-9 a-z A-Z --><li><p>If the character isn't in the range U+0020, U+002A, - U+002D, U+002E, U+0030 .. U+0039, U+0041 .. U+005A, U+005F, - U+0061 .. U+007A then replace the character with a string + U+002D, U+002E, U+0030 to U+0039, U+0041 to U+005A, U+005F, + U+0061 to U+007A then replace the character with a string formed as follows: Start with the empty string, and then, taking each byte of the character when expressed in the selected character encoding in turn, append to the string a @@ -53315,8 +53315,8 @@ </div><p>Many strings in the HTML syntax (e.g. the names of elements and their attributes) are case-insensitive, but only for characters in - the ranges U+0041 .. U+005A (LATIN CAPITAL LETTER A to LATIN CAPITAL - LETTER Z) and U+0061 .. U+007A (LATIN SMALL LETTER A to LATIN SMALL + the ranges U+0041 to U+005A (LATIN CAPITAL LETTER A to LATIN CAPITAL + LETTER Z) and U+0061 to U+007A (LATIN SMALL LETTER A to LATIN SMALL LETTER Z). For convenience, in this section this is just referred to as "case-insensitive".<h4 id="the-doctype"><span class="secno">9.1.1 </span>The DOCTYPE</h4><p class="XXX annotation"><span><a href="http://www.w3.org/html/wg/tracker/issues/4">ISSUE-4</a> (html-versioning) blocks progress to Last Call</span><p>A <dfn id="syntax-doctype" title="syntax-doctype">DOCTYPE</dfn> is a mostly useless, but required, header.<p class="note">DOCTYPEs are required for legacy reasons. When
Received on Tuesday, 20 October 2009 01:20:48 UTC