W3C home > Mailing lists > Public > public-i18n-core@w3.org > October to December 2015

I18N-ISSUE-502: Unicode equivalent type all not clear ⓟ [find-text]

From: Internationalization Working Group Issue Tracker <sysbot+tracker@w3.org>
Date: Fri, 16 Oct 2015 21:50:28 +0000
To: public-i18n-core@w3.org
Message-Id: <E1ZnCtQ-0006Rg-6V@deneb.w3.org>
I18N-ISSUE-502: Unicode equivalent type all not clear ⓟ [find-text]

http://www.w3.org/International/track/issues/502

Raised by: Addison Phillips
On product: find-text

http://www.w3.org/TR/2015/WD-findtext-20151015/#idl-def-UnicodeEquivalenceType.canonical

The specification defines a Unicode equivalence type 'all' whose usage is unclear. Here's the description:

--
The search MUST use either canonical (NFC/NFD) or compatibility (NFKC/NFKD) Unicode equivalence of characters, whichever produces the more liberal match. 
--

The intention appears to be that all four must be applied to determine which is the "most liberal match". This, combined with other options (case folding, distance) is likely to be extremely slow and possible not useful.

Note that one of the compatibility normalization forms is likely to be the most promiscuous match in all cases.
Received on Friday, 16 October 2015 21:50:34 UTC

This archive was generated by hypermail 2.3.1 : Friday, 16 October 2015 21:50:35 UTC