[Bug 7935] [F&O] normalize-unicode on codepoints that are not characters. from bugzilla@wiggum.w3.org on 2009-10-16 (public-qt-comments@w3.org from October 2009)

From: <bugzilla@wiggum.w3.org>
Date: Fri, 16 Oct 2009 17:34:02 +0000
To: public-qt-comments@w3.org
Message-Id: <E1Myqgk-00082f-My@wiggum.w3.org>

http://www.w3.org/Bugs/Public/show_bug.cgi?id=7935





--- Comment #1 from Michael Kay <mike@saxonica.com>  2009-10-16 17:34:02 ---
I would think that the result of normalizing codepoints that are not assigned
to any character should be to leave the codepoint unchanged in the result.

I'm pretty sure this is what the reference implementation from the Unicode
Consortium does.

It's always a problem of course when you want to reuse library code that has
made a different decision, but I think that as with regexes, we should avoid
letting that implementation concern influence our spec.

And I don't think we should be making value judgements that certain codepoints
or characters are bad. Some of our users might think it good that there are
codepoints they can use as they like. For better or worse we've chosen to make
them legal, and that's good enough.


-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.

Received on Friday, 16 October 2009 17:34:04 UTC