W3C home > Mailing lists > Public > www-tag@w3.org > October 2004

Re: XML Chunk Equality

From: Norman Walsh <Norman.Walsh@Sun.COM>
Date: Tue, 26 Oct 2004 15:54:17 -0400
To: Elliotte Rusty Harold <elharo@metalab.unc.edu>
Cc: www-tag@w3.org
Message-id: <87pt35i7bq.fsf@nwalsh.com>
/ Elliotte Rusty Harold <elharo@metalab.unc.edu> was heard to say:
| The finding says:
| Natural Language. The xml:lang attribute is not treated specially in
| the Infoset but is intended to have a scoped effect much like the base
| URI. This intention is made explicit in this specification.
| I'm assume you're aware that the XInclude group is in the process of
| creating a new language property for the element information item. How
| does this affect the finding?

I haven't thought carefully about that yet.

| "Languages are compared case insensitively."
| Using what language? Case mappings are not always the same from
| language to language. I is not the uppercase form of i in Turkish, for
| example. xml:lang values are normally ASCII but don't have to be.
| What's probably intended here is that languages are compared case
| insensitively within the ASCII range using English case mappings.

Uhm, yes that should be made clearer. The content of xml:lang is
supposed to be IETF RFC 3066 language tags, which are only US ASCII, I

| 2.6 Unexpanded Entity Reference Information Items
| Should system identifiers that differ only in case of the protocol and
| or the the authority be considered unequal? What about system
| identifiers that use percent escapes?
| Same question for Document Type Declaration Information Items and
| UNparsed entity information items.

I think the easy answer is, performe Unicode code-point comparison.

                                        Be seeing you,

Norman.Walsh@Sun.COM / XML Standards Architect / Sun Microsystems, Inc.
NOTICE: This email message is for the sole use of the intended
recipient(s) and may contain confidential and privileged information.
Any unauthorized review, use, disclosure or distribution is prohibited.
If you are not the intended recipient, please contact the sender by
reply email and destroy all copies of the original message.

Received on Tuesday, 26 October 2004 19:54:24 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:56:06 UTC