W3C home > Mailing lists > Public > www-international@w3.org > July to September 2012

I18N-ISSUE-191: Various nits in Appendix B [TURTLE]

From: Internationalization Core Working Group Issue Tracker <sysbot+tracker@w3.org>
Date: Fri, 07 Sep 2012 16:01:12 +0000
Message-Id: <E1TA0zY-0003XQ-KX@nelson.w3.org>
To: www-international@w3.org, public-rdf-comments@w3.org
I18N-ISSUE-191: Various nits in Appendix B [TURTLE]

http://www.w3.org/International/track/issues/191

Raised by: Addison Phillips
On product: TURTLE

Appendix B contains this note:

Encoding considerations:
    The syntax of Turtle is expressed over code points in Unicode [UNICODE]. The encoding is always UTF-8 [UTF-8].
    Unicode code points may also be expressed using an \uXXXX (U+0 to U+FFFF) or \UXXXXXXXX syntax (for U+10000 onwards) where X is a hexadecimal digit [0-9A-Fa-f]

As mentioned in other comments:

- The encoding refers to the serialization of a TURTLE document, not necessarily its in-memory representation (which should just be a sequence of Unicode code points)
- The reference to U+0 should read U+0000
- We recommend a different escape syntax altogether
- We recommend six-digit rather than eight-digit \U representation
Received on Friday, 7 September 2012 16:01:18 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 7 September 2012 16:01:19 GMT