W3C home > Mailing lists > Public > www-rdf-comments@w3.org > July to September 2003

Re: N-Triples Unicode Ambiguity

From: Dave Beckett <dave.beckett@bristol.ac.uk>
Date: Fri, 5 Sep 2003 09:38:58 +0100
To: "Sean B. Palmer" <sean@mysterylights.com>
Cc: RDF Comments <www-rdf-comments@w3.org>
Message-Id: <20030905093858.6e27bb30.dave.beckett@bristol.ac.uk>

On Fri, 5 Sep 2003 02:36:13 +0100
"Sean B. Palmer" <sean@mysterylights.com> wrote:

> Section 3.2 of rdf-testcases [1] states:
> 
>    \UHHHHHHHH
>        8 required hexadecimal digits HHHHHHHH encoding
>        character [#x10000-#x10FFFF]
> 
> Which implies that any code point over U+10FFFF cannot be represented
> in an N-Triples string, unless encoded as a surrogate block. However,
> the test.nt N-Triples test file [2] referenced from rdf-testcases
> contains the following literal production instances:
> 
>    "\U001FFFFF" # resource18
>    "\U03FFFFFF" # resource19
>    "\U7FFFFFFF" # resource20
> 
> Each of which are greater than U+10FFFF. Is the rdf-testcases in
> error, or test.nt, or neither?

> [2] http://www.w3.org/2000/10/rdf-tests/rdfcore/ntriples/test.nt

The test.nt [2] is in error, pre-dating the previous correction
to the working draft to allow only code points up to #x10FFFF (from a
comment by I18N).  I will fix it shortly.

Thanks

Dave
Received on Friday, 5 September 2003 04:40:45 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 21 September 2012 14:16:32 GMT