[Bug 9989] Is the number of replacement characters supposed to be well-defined? If not this should be explicitly noted. If it is then more detail is required.

http://www.w3.org/Bugs/Public/show_bug.cgi?id=9989


Simon Pieters <simonp@opera.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
         Resolution|NEEDSINFO                   |




--- Comment #2 from Simon Pieters <simonp@opera.com>  2010-07-22 13:25:19 ---
The spec says to replace bytes *or* sequences of bytes that are not valid utf-8
with U+FFFD. It is thus not well-defined how many U+FFFD are expected for any
given sequence of bytes that are not valid utf-8. It could be one or the same
amount of bytes that are not valid, or anything in between.

(The same applies to text/html parsing.)

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

Received on Thursday, 22 July 2010 13:25:21 UTC