W3C home > Mailing lists > Public > www-international@w3.org > January to March 2014

[Bug 24104] Clarify how encoders should deal with lone surrogates

From: <bugzilla@jessica.w3.org>
Date: Fri, 28 Mar 2014 15:58:56 +0000
To: www-international@w3.org
Message-ID: <bug-24104-4285-Tq0EohVZ3k@http.www.w3.org/Bugs/Public/>

--- Comment #6 from Anne <annevk@annevk.nl> ---
Generally. But it affects form submission and URLs of course.

It seems Unicode has the contract as a mapping of Unicode scalar values (code
points minus surrogates) to bytes and vice versa. That seems reasonable to me
but does mean that everyone using encoders/decoders has to convert their code
point sequence to a Unicode scalar value sequence first.

You are receiving this mail because:
You are on the CC list for the bug.
Received on Friday, 28 March 2014 15:58:58 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:41:04 UTC