- From: Internationalization Core Working Group Issue Tracker <sysbot+tracker@w3.org>
- Date: Fri, 07 Sep 2012 15:46:40 +0000
- To: www-international@w3.org, public-rdf-comments@w3.org
I18N-ISSUE-187: escape syntax [TURTLE]
http://www.w3.org/International/track/issues/187
Raised by: Addison Phillips
On product: TURTLE
Section 6.4. The \u (lowercase u) syntax allows:
<q>
A Unicode codepoint in the range U+0 to U+FFFF inclusive corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit.
</q>
This is probably wrong, given that the surrogate code points fall into this range. No mention is made of surrogate pair handling.
It's not clear why the \U form should take eight hex digits when the first two are required to be 0.
Also, the trend seems to be going towards the variable-width form "\u{xxxxx}". See, for example:
http://unicode.org/reports/tr18/#Hex_notation
http://norbertlindenberg.com/2012/05/ecmascript-supplementary-characters/index.html#Escapes
Received on Friday, 7 September 2012 15:46:46 UTC