W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > November 2011

[Bug 14526] WF2: When adding filenames to the data set, should there be normalization of decomposed forms?

From: <bugzilla@jessica.w3.org>
Date: Wed, 02 Nov 2011 20:43:19 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1RLheZ-0002Pq-Rd@jessica.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=14526

Masatoshi Kimura <VYV03354@nifty.ne.jp> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |VYV03354@nifty.ne.jp

--- Comment #15 from Masatoshi Kimura <VYV03354@nifty.ne.jp> 2011-11-02 20:43:18 UTC ---
Mac OS uses a special variant of NFD to avoid normalizing CJK Compatibility
Ideographs because some of Compatibility Ideographs is important (even
required) in Japan. Roughly speaking, It excludes a specific ranges of code
points from Normalization.

I found a proposal document from Apple (but rejected by UTC).
http://www.unicode.org/review/resolved-pri.html#pri7
http://www.unicode.org/review/pr-7b.html
Note that this proposal is a bit different from what Mac OS is actually using.
Mac OS also excludes code points from U+2000 to U+2FFF.

I think we should define "willful violation of UAX #15" or "Web Normalization"
or something other than NFC.

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Wednesday, 2 November 2011 20:43:25 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 2 November 2011 20:43:25 GMT