W3C home > Mailing lists > Public > www-rdf-comments@w3.org > July to September 2003

N-Triples Unicode Ambiguity

From: Sean B. Palmer <sean@mysterylights.com>
Date: Fri, 5 Sep 2003 02:36:13 +0100
Message-ID: <01cd01c3734e$183e8660$1054ff3e@z5n9x1>
To: "RDF Comments" <www-rdf-comments@w3.org>

Section 3.2 of rdf-testcases [1] states:

   \UHHHHHHHH
       8 required hexadecimal digits HHHHHHHH encoding
       character [#x10000-#x10FFFF]

Which implies that any code point over U+10FFFF cannot be represented
in an N-Triples string, unless encoded as a surrogate block. However,
the test.nt N-Triples test file [2] referenced from rdf-testcases
contains the following literal production instances:

   "\U001FFFFF" # resource18
   "\U03FFFFFF" # resource19
   "\U7FFFFFFF" # resource20

Each of which are greater than U+10FFFF. Is the rdf-testcases in
error, or test.nt, or neither?

[1] http://www.w3.org/TR/rdf-testcases/#ntrip_strings
- 3.2 Strings. W3C Working Draft 23 January 2003
[2] http://www.w3.org/2000/10/rdf-tests/rdfcore/ntriples/test.nt
- $Id: test.nt,v 1.6 2003/08/03 16:07:09 dbeckett2 Exp $

--
Sean B. Palmer, http://purl.org/net/sbp/
"phenomicity by the bucketful" - http://miscoranda.com/
Received on Thursday, 4 September 2003 21:39:24 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 21 September 2012 14:16:32 GMT