Re: N-Triples Unicode Ambiguity

On Fri, 5 Sep 2003 02:36:13 +0100
"Sean B. Palmer" <sean@mysterylights.com> wrote:

> Section 3.2 of rdf-testcases [1] states:
> 
>    \UHHHHHHHH
>        8 required hexadecimal digits HHHHHHHH encoding
>        character [#x10000-#x10FFFF]
> 
> Which implies that any code point over U+10FFFF cannot be represented
> in an N-Triples string, unless encoded as a surrogate block. However,
> the test.nt N-Triples test file [2] referenced from rdf-testcases
> contains the following literal production instances:
> 
>    "\U001FFFFF" # resource18
>    "\U03FFFFFF" # resource19
>    "\U7FFFFFFF" # resource20
> 
> Each of which are greater than U+10FFFF. Is the rdf-testcases in
> error, or test.nt, or neither?

> [2] http://www.w3.org/2000/10/rdf-tests/rdfcore/ntriples/test.nt

The test.nt [2] is in error, pre-dating the previous correction
to the working draft to allow only code points up to #x10FFFF (from a
comment by I18N).  I will fix it shortly.

Thanks

Dave

Received on Friday, 5 September 2003 04:40:45 UTC