i18n-ISSUE-467: What are the rules for string equality when column names are matched with annotations from Steven Atkin on 2015-06-01 (public-csv-wg@w3.org from June 2015)

From: Steven Atkin <atkin@us.ibm.com>
Date: Mon, 1 Jun 2015 09:55:17 -0700
To: public-csv-wg@w3.org, www-international@w3.org
Message-ID: <OF109013C4.0250C5A5-ON88257E57.005CD446-88257E57.005CF3D7@us.ibm.com>

6.2 Example with single table and rich annotations
http://www.w3.org/TR/2015/WD-csv2json-20150416/#example-tree-ops-ext

When the names of the columns in the CSV data are compared with the names
of the columns in the annotations what is the rule for determining if they
are the same? For example, is equality based solely on the UTF-8 raw byte
sequence or is some form of Unicode Normalization applied first and does
case matter when making comparisons?

It is recommended that Unicode text not be normalized if it is already in a
Unicode encoding. If text needs to be converted into Unicode, then a
normalizing transcoder should be used and text be normalized into Unicode
Normal Form C.

It is recommended that case sensitive matching be used when making
comparisons.


Steven Atkin, Ph.D.
STSM - Chief Globalization Architect
IBM Globalization Center of Competency
atkin@us.ibm.com
http://www-3.ibm.com/software/globalization/index.jsp

Received on Monday, 1 June 2015 16:57:31 UTC