W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > May 2011

[Bug 12100] UAs do not actually convert DOMStrings to sequences of Unicode characters. Test case: data:text/html,<!doctype html><script>document.documentElement.title = "\ud800"; alert(document.documentElement.title.charCodeAt(0));</script> Expected 65533, got 5529

From: <bugzilla@jessica.w3.org>
Date: Thu, 05 May 2011 06:34:09 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1QHs8X-0003TE-0o@jessica.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=12100

--- Comment #7 from Ian 'Hixie' Hickson <ian@hixie.ch> 2011-05-05 06:34:08 UTC ---
(In reply to comment #6)
> 
> Are there, in implementation reality, any APIs that don't operate with UTF-16
> code units? At least in Firefox, giving an unpaired surrogate to window.alert()
> doesn't throw. It shows an alert with a box that shows the hex for the
> surrogate.

That's the kind of bug I think we should be trying to fix here. I'll be the
first to say it shouldn't be a high priority. But it's the kind of thing that
can be fixed as people go through codebases, little fix here, little fix there,
until eventually it's all Unicode clean (except the DOM, which is a lost cause,
sadly).

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Thursday, 5 May 2011 06:34:10 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 16:31:09 UTC