RE: A question about the str( ) function on IRI

-------- Original Message --------
> From: Lu Jing <>
> Date: 14 July 2006 11:57
> 
> Hi,
> I believe this is a trivial question, but it did confuse me for a long
> time.  In the Section 11.4 "Operators Definitions", the description
for 
> str() says "returns the codepoint representation of rsrc (an IRI).".
I
> think the "codepoint" means the code of characters, in non-negative
> integers.  So I wonder why the "codepoint" is used here.

codepoint is a Unicode name: characters are abstract and get encoded in
codepoints.

http://www.unicode.org/glossary/

"""
Code Point. Any value in the Unicode codespace; that is, the range of
integers from 0 to 10FFFF16. (See definition D4b in Section 3.4,
Characters and Encoding.)
"""

or as says here:
http://www.w3.org/TR/xpath-functions/#string-types
"""
This document uses the term "code point", sometimes spelt "codepoint"
(also known as "character number" or "code position") to mean a
non-negative integer that represents a character in some encoding.
"""

>  Also, I think
> some of the readers (include me) may think if the result of
> str(<mailto:alice@work.example>) is "<mailto:alice@work.example>", but
> not "mailto:alice@work.example".

The terms delimited by "<>" and these do not form part of the IRI.  I've
added some text to explicitly state that in the section on IRI syntax.

>  Unfortunately, the following example
> uses a regex function to test the value, so that it can not be used to
> distinguish the above two possibilities. (Although I believe that the
> answer should be "mailto:alice@work.example")      

	Thanks,
	Andy

Received on Friday, 14 July 2006 13:39:43 UTC