W3C home > Mailing lists > Public > public-html-commits@w3.org > January 2011

html5/spec Overview.html,1.4595,1.4596

From: Ian Hickson via cvs-syncmail <cvsmail@w3.org>
Date: Sat, 01 Jan 2011 05:46:25 +0000
To: public-html-commits@w3.org
Message-Id: <E1PYuIM-0007YX-3U@lionel-hutz.w3.org>
Update of /sources/public/html5/spec
In directory hutz:/tmp/cvs-serv29028

Modified Files:
	Overview.html 
Log Message:
Define 'overlong form'. (whatwg r5733)

Index: Overview.html
===================================================================
RCS file: /sources/public/html5/spec/Overview.html,v
retrieving revision 1.4595
retrieving revision 1.4596
diff -u -d -r1.4595 -r1.4596
--- Overview.html	1 Jan 2011 05:28:42 -0000	1.4595
+++ Overview.html	1 Jan 2011 05:46:22 -0000	1.4596
@@ -3192,7 +3192,7 @@
 
   <dl class="switch"><dt>One byte in the range FE to FF</dt>
 
-   <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+   <dt><a href="#overlong-form" title="overlong form">Overlong forms</a> (e.g. F0 80 80 A0)</dt>
 
    <dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
 
@@ -3226,7 +3226,10 @@
 
    <dd>Each byte must be replaced with a U+FFFD REPLACEMENT CHARACTER.</dd>
 
-  </dl><p class="example">For example, the byte string "41 98 BA 42 E2 98
+  </dl><p>For the purposes of the above requirements, an <dfn id="overlong-form">overlong
+  form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+  more bytes than the minimum needed to encode that codepoint in
+  UTF-8.<p class="example">For example, the byte string "41 98 BA 42 E2 98
   43 E2 98 BA E2 98" would be converted to the string
   "A&#65533;&#65533;B&#65533;C&#9786;&#65533;".<h3 id="common-microsyntaxes"><span class="secno">2.5 </span>Common microsyntaxes</h3><p>There are various places in HTML that accept particular data
   types, such as dates or numbers. This section describes what the
Received on Saturday, 1 January 2011 05:46:27 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Saturday, 1 January 2011 05:46:27 GMT