W3C home > Mailing lists > Public > public-rdf-dawg-comments@w3.org > July 2006

RE: A question about the str( ) function on IRI

From: Seaborne, Andy <andy.seaborne@hp.com>
Date: Fri, 14 Jul 2006 14:39:16 +0100
Message-ID: <DF5E364A470421429AE6DC96979A4F6FB6AB3F@sdcexc04.emea.cpqcorp.net>
To: "Lu Jing" <robert_lu00@hotmail.com>, <public-rdf-dawg-comments@w3.org>

-------- Original Message --------
> From: Lu Jing <>
> Date: 14 July 2006 11:57
> Hi,
> I believe this is a trivial question, but it did confuse me for a long
> time.  In the Section 11.4 "Operators Definitions", the description
> str() says "returns the codepoint representation of rsrc (an IRI).".
> think the "codepoint" means the code of characters, in non-negative
> integers.  So I wonder why the "codepoint" is used here.

codepoint is a Unicode name: characters are abstract and get encoded in


Code Point. Any value in the Unicode codespace; that is, the range of
integers from 0 to 10FFFF16. (See definition D4b in Section 3.4,
Characters and Encoding.)

or as says here:
This document uses the term "code point", sometimes spelt "codepoint"
(also known as "character number" or "code position") to mean a
non-negative integer that represents a character in some encoding.

>  Also, I think
> some of the readers (include me) may think if the result of
> str(<mailto:alice@work.example>) is "<mailto:alice@work.example>", but
> not "mailto:alice@work.example".

The terms delimited by "<>" and these do not form part of the IRI.  I've
added some text to explicitly state that in the section on IRI syntax.

>  Unfortunately, the following example
> uses a regex function to test the value, so that it can not be used to
> distinguish the above two possibilities. (Although I believe that the
> answer should be "mailto:alice@work.example")      

Received on Friday, 14 July 2006 13:39:43 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:52:07 UTC