W3C home > Mailing lists > Public > www-style@w3.org > February 2009

Re: [CSS21][css3-namespace][css3-page][css3-selectors][css3-content] Unicode Normalization

From: Andrew Cunningham <andrewc@vicnet.net.au>
Date: Wed, 04 Feb 2009 09:57:01 +1100
Message-ID: <4988CBBD.2020001@vicnet.net.au>
To: Henri Sivonen <hsivonen@iki.fi>
CC: Boris Zbarsky <bzbarsky@MIT.EDU>, public-i18n-core@w3.org, W3C Style List <www-style@w3.org>
Hi Henri,

Henri Sivonen wrote:

> Surely a MARC21-based system should then use this data representation 
> internally but not leak it to a Web UI for the system.
I agree whole heartedly

But the reality is very fewILMSs normalise

But then that not really any different form the average website, content 
management system, web application, web service or editing environment.

Current industry practice as a rule is not to normalise. I could 
speculate that this is because most developers have little understanding 
or knowledge of working with lesser used languages, and do not know how 
to handle languages these languages.

Standard European language keyboards always generate NFC data. You have 
to deliberately go out of oyur way to generate NFD for European 
languages. I suspect the average developer has never seen European 
language data in NFD format. They don't have to deal with it, they don't 
think about it, they don't build it into their web apps. So these tools 
then become problematic for people using lesser used languages.

although if i was using a risk analysis scale, I'd rate normalisation of 
selectors as a low impact issue, since it wouldn't occur frequently.

Poor design generally tends to be the biggest barrier for lesser used 

Andrew Cunningham
Senior Manager, Research and Development
State Library of Victoria
328 Swanston Street
Melbourne VIC 3000

Ph: +61-3-8664-7430
Fax: +61-3-9639-2175

Email: andrewc@vicnet.net.au
Alt email: lang.support@gmail.com


Received on Tuesday, 3 February 2009 22:58:26 UTC

This archive was generated by hypermail 2.3.1 : Monday, 2 May 2016 14:38:23 UTC