are SPARQL queries unicode?

Björn commented [CMNT] that productions like:
  NCCHAR ::= NCCHAR1 | '-' | [0-9] | #x00B7 | [#x0300-#x036F] | [#x203F-#x2040]
and even
  WS ::= #x20 | #x9 | #xD | #xA
need to specify a codepoint convention for those numbers to mean
anything.

We've since visited this text, but in the interest of clarity, I am
considering changing our current text from:
[[
A SPARQL query string is a Unicode character string (c.f. section 6.1
String concepts of [CHARMOD]) in the language defined by the following
grammar, starting with the Query production.  The EBNF format is the
same as that used in the XML 1.1 specification[XML11]. Please see the
"Notation" section of that specification for specific information about
the notation.
]]

to:
[[
A SPARQL query is a string (c.f. section 6.1 String concepts of
[CHARMOD]) in the language defined by the following grammar, starting
with the Query production.  The EBNF format is the same as that used in
the XML 1.1 specification[XML11]. Numeric references,
e.g. <code>#x27</code> or <code>#xxD7FF</code>, identify charactars by
unicode codepoint. Please see the "Notation" section of that
specification for specific information about the notation.
]]

This says that the grammar is read as unicode codepoints (editorial)
and says that SPARQL Queries are independent of encoding (substantive).

[CMNT] http://www.w3.org/mid/43046b29.399234875@smtp.bjoern.hoehrmann.de
-- 
-eric

office: +81.466.49.1170 W3C, Keio Research Institute at SFC,
                        Shonan Fujisawa Campus, Keio University,
                        5322 Endo, Fujisawa, Kanagawa 252-8520
                        JAPAN
        +1.617.258.5741 NE43-344, MIT, Cambridge, MA 02144 USA
cell:   +81.90.6533.3882

(eric@w3.org)
Feel free to forward this message to any list for any purpose other than
email address distribution.

Received on Tuesday, 25 October 2005 17:32:16 UTC