Re: pfps-04 (why the thread is germane to pfps-04) from Martin Duerst on 2003-07-25 (www-rdf-comments@w3.org from July to September 2003)

From: Martin Duerst <duerst@w3.org>
Date: Fri, 25 Jul 2003 15:28:38 -0400
To: Brian McBride <bwm@hplb.hpl.hp.com>, "Peter F. " Patel-Schneider <pfps@research.bell-labs.com>
Cc: jjc@hplb.hpl.hp.com, Pat Hayes <phayes@ai.uwf.edu>, www-rdf-comments@w3.org, i18n <w3c-i18n-ig@w3.org>, msm@w3.org
Message-Id: <4.2.0.58.J.20030725151422.057ff8f8@localhost>

Hello Brian,

At 14:00 03/07/25 +0100, Brian McBride wrote:

>I think we've established that UNICODE characters and octet sequences
>are disjoint.  Martin, chair of the I18N group confirmed this in:
>
>http://lists.w3.org/Archives/Public/www-rdf-comments/2003JulSep/0069.html
>
>Whilst Martin does not like the RDFCore design,

That's an altogether separate issue.

>as currently specified
>in the ed's drafts, XMLLiterals and plain literals are disjoint.

Sorry, but this is not a question of design. If the RDFCore design is
than XML Literals and plain literals are disjoint, then the RDF specs
should just say that they are different. The spec says what it says.

If you think you can avoid implementations making mistakes such as
suddenly coming up with equivalences, then I have to tell you that
just by saying that one of them is characters and the other is
octets based on UTF-8, that doesn't necessarily avoid problems.
It may actually happen that character strings are also stored
in UTF-8, and that by chance an implementer just does a simple
strcmp(). This is a different issue from the conceptual level
that I understand Peter is interested, where characters and
octets are definitely completely different things.
If there is anything else that you are worrying about,
I would appreciate to know.

Regards,    Martin.

Received on Friday, 25 July 2003 16:21:37 UTC