- From: Richard Cyganiak <richard@cyganiak.de>
- Date: Fri, 7 Sep 2012 18:17:39 +0100
- To: Gavin Carothers <gavin@carothers.name>, www-international@w3.org
- Cc: Internationalization Core Working Group Issue Tracker <sysbot+tracker@w3.org>, public-rdf-comments Comments <public-rdf-comments@w3.org>
On 7 Sep 2012, at 17:37, Gavin Carothers wrote:
>>> It's not clear why the \U form should take eight hex digits when the
>>> first two are required to be 0.
>>
>> Because C++ did it and everybody follows them. It's better if all languages
>> have the same representation of strings, even if it's not a very good one.
>
> Turtle's is inherited from Python, but I believe Python's is from C++
\uXXXX and \UXXXXXXXX are also in ISO C AFAIK.
I like the \u{X} form (where X may be 1-6 hex digits) that seems to be under consideration for ECMAScript. I believe Ruby does this too.
But I feel that Turtle should not add anything new here unless it gets into SPARQL too.
I feel that the \uxxxx and \UXXXXXXXX forms cannot be removed at this point due to existing implementations and deployed data. Both forms have been in N-Triples since 2004. N-Triples is defined in a W3C Recommendation [1], and Turtle is designed as a superset of N-Triples.
Best,
Richard
[1] http://www.w3.org/TR/rdf-testcases/#ntrip_strings
Received on Friday, 7 September 2012 17:18:09 UTC