I18N-ISSUE-187: escape syntax [TURTLE]

I18N-ISSUE-187: escape syntax [TURTLE]

http://www.w3.org/International/track/issues/187

Raised by: Addison Phillips
On product: TURTLE

Section 6.4. The \u (lowercase u) syntax allows:

<q>
A Unicode codepoint in the range U+0 to U+FFFF inclusive corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit.
</q>

This is probably wrong, given that the surrogate code points fall into this range. No mention is made of surrogate pair handling.

It's not clear why the \U form should take eight hex digits when the first two are required to be 0.

Also, the trend seems to be going towards the variable-width form "\u{xxxxx}". See, for example: 

http://unicode.org/reports/tr18/#Hex_notation
http://norbertlindenberg.com/2012/05/ecmascript-supplementary-characters/index.html#Escapes 

Received on Friday, 7 September 2012 15:46:46 UTC