Re: Allowing \u escaped surrogate pairs

[I'm deliberately not putting this in the issue, because I want the issue to 
look clean.]

As far as I can tell, surrogates are not allowed at all in RDF 1.1 Turtle. 
The reason is that numeric escape sequences represent Unicode code points that 
are Unicode characters.  This appears to be only stated in Section 6.4.

So "\uD83C\uDCA1" is not valid in RDF 1.1 Turtle.

Again as far as I can tell, RDF 1.2 Turtle liberalizes RDF 1.1 Turtle because 
it allows any non-surrogate Unicode code point for numeric escape sequences, 
not just Unicode characters.

So "\uFFFE" is valid in RDF 1.2 Turtle, but not valid in RDF 1.1 Turtle.

Does anyone disagree with my conclusions?

peter




On 4/28/26 4:26 AM, Andy Seaborne wrote:
> As promised at the last telecon, I put together a position for responding to 
> the i18n wide review comment [1]
> 
> https://github.com/w3c/rdf-turtle/issues/138
> 
> Summary: support valid surrogate pairs written as \u escape sequences.
> 
>      Andy
> 
> [1] https://github.com/w3c/rdf-turtle/issues/131
>      https://github.com/w3c/rdf-trig/issues/60
> 
> 

Received on Tuesday, 28 April 2026 12:15:52 UTC