W3C home > Mailing lists > Public > www-rdf-comments@w3.org > July to September 2003

Re: pfps-04 (why the thread is germane to pfps-04)

From: Martin Duerst <duerst@w3.org>
Date: Fri, 25 Jul 2003 15:28:38 -0400
Message-Id: <4.2.0.58.J.20030725151422.057ff8f8@localhost>
To: Brian McBride <bwm@hplb.hpl.hp.com>, "Peter F. " Patel-Schneider <pfps@research.bell-labs.com>
Cc: jjc@hplb.hpl.hp.com, Pat Hayes <phayes@ai.uwf.edu>, www-rdf-comments@w3.org, i18n <w3c-i18n-ig@w3.org>, msm@w3.org

Hello Brian,

At 14:00 03/07/25 +0100, Brian McBride wrote:

>I think we've established that UNICODE characters and octet sequences
>are disjoint.  Martin, chair of the I18N group confirmed this in:
>
>http://lists.w3.org/Archives/Public/www-rdf-comments/2003JulSep/0069.html
>
>Whilst Martin does not like the RDFCore design,

That's an altogether separate issue.


>as currently specified
>in the ed's drafts, XMLLiterals and plain literals are disjoint.

Sorry, but this is not a question of design. If the RDFCore design is
than XML Literals and plain literals are disjoint, then the RDF specs
should just say that they are different. The spec says what it says.

If you think you can avoid implementations making mistakes such as
suddenly coming up with equivalences, then I have to tell you that
just by saying that one of them is characters and the other is
octets based on UTF-8, that doesn't necessarily avoid problems.
It may actually happen that character strings are also stored
in UTF-8, and that by chance an implementer just does a simple
strcmp(). This is a different issue from the conceptual level
that I understand Peter is interested, where characters and
octets are definitely completely different things.
If there is anything else that you are worrying about,
I would appreciate to know.

Regards,    Martin.
Received on Friday, 25 July 2003 16:21:37 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 21 September 2012 14:16:32 GMT