hixie: Define 'overlong form'. (whatwg r5733)

hixie: Define 'overlong form'. (whatwg r5733)

http://dev.w3.org/cvsweb/html5/spec/Overview.html?r1=1.4595&r2=1.4596&f=h
http://html5.org/tools/web-apps-tracker?from=5732&to=5733

===================================================================
RCS file: /sources/public/html5/spec/Overview.html,v
retrieving revision 1.4595
retrieving revision 1.4596
diff -u -d -r1.4595 -r1.4596
--- Overview.html 1 Jan 2011 05:28:42 -0000 1.4595
+++ Overview.html 1 Jan 2011 05:46:22 -0000 1.4596
@@ -3192,7 +3192,7 @@
 
   <dl class="switch"><dt>One byte in the range FE to FF</dt>
 
-   <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+   <dt><a href="#overlong-form" title="overlong form">Overlong forms</a> (e.g. F0 80 80 A0)</dt>
 
    <dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
 
@@ -3226,7 +3226,10 @@
 
    <dd>Each byte must be replaced with a U+FFFD REPLACEMENT CHARACTER.</dd>
 
-  </dl><p class="example">For example, the byte string "41 98 BA 42 E2 98
+  </dl><p>For the purposes of the above requirements, an <dfn id="overlong-form">overlong
+  form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+  more bytes than the minimum needed to encode that codepoint in
+  UTF-8.<p class="example">For example, the byte string "41 98 BA 42 E2 98
   43 E2 98 BA E2 98" would be converted to the string
   "A&#65533;&#65533;B&#65533;C&#9786;&#65533;".<h3 id="common-microsyntaxes"><span class="secno">2.5 </span>Common microsyntaxes</h3><p>There are various places in HTML that accept particular data
   types, such as dates or numbers. This section describes what the

Received on Wednesday, 12 January 2011 02:44:27 UTC