W3C home > Mailing lists > Public > public-i18n-core@w3.org > January to March 2009

Re: Unicode Normalization thread should slow down; summary needed

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Wed, 11 Feb 2009 12:01:21 +0100
To: Henri Sivonen <hsivonen@iki.fi>
Cc: public-i18n-core@w3.org, W3C Style List <www-style@w3.org>
Message-ID: <pd65p49k8n7dlb2miq4nfqjt1lpn1hap0d@hive.bjoern.hoehrmann.de>

* Henri Sivonen wrote:
>What was the thrust of the i18n core comments then? Except for your  
>remarks, as far as I can tell, the thread has revolved around keyboard  
>input order or differences in input methods between operating systems  
>causing different code point sequences for same visual apperances.

I would think the I18N Core Working Group is well aware that there are
too many ways in which strings end up in web content to count them all
and I consequently took discussion about direct character by character
user input as well as those about identifiers as mere illustration of
the general problem. How big a problem that is or what requirements a
solution would have to meet nobody seems to know.

As far as CSS Selectors go, they largely state that strings are to be
compared using some collation defined by the language of the document.
Used on an XML document, if the XML specification says NFC(Björn) and
NFD(Björn) are different IDs, then there is no basis for Selectors to
match otherwise. As such this discussion is largely misplaced on www-
style.
-- 
Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 
Received on Wednesday, 11 February 2009 11:02:00 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 11 February 2009 11:02:03 GMT