- From: Internationalization Core Working Group Issue Tracker <sysbot+tracker@w3.org>
- Date: Fri, 07 Sep 2012 16:01:12 +0000
- To: www-international@w3.org, public-rdf-comments@w3.org
I18N-ISSUE-191: Various nits in Appendix B [TURTLE]
http://www.w3.org/International/track/issues/191
Raised by: Addison Phillips
On product: TURTLE
Appendix B contains this note:
Encoding considerations:
The syntax of Turtle is expressed over code points in Unicode [UNICODE]. The encoding is always UTF-8 [UTF-8].
Unicode code points may also be expressed using an \uXXXX (U+0 to U+FFFF) or \UXXXXXXXX syntax (for U+10000 onwards) where X is a hexadecimal digit [0-9A-Fa-f]
As mentioned in other comments:
- The encoding refers to the serialization of a TURTLE document, not necessarily its in-memory representation (which should just be a sequence of Unicode code points)
- The reference to U+0 should read U+0000
- We recommend a different escape syntax altogether
- We recommend six-digit rather than eight-digit \U representation
Received on Friday, 7 September 2012 16:01:17 UTC