W3C home > Mailing lists > Public > www-i18n-comments@w3.org > July 2002

Re: Heuristics

From: Martin Duerst <duerst@w3.org>
Date: Fri, 12 Jul 2002 11:40:31 +0900
Message-Id: <4.2.0.58.J.20020712113346.057c4910@localhost>
To: cmsmcq@acm.org (C. M. Sperberg-McQueen), www-i18n-comments@w3.org
Cc: w3c-i18n-ig@w3.org

Hello Michael,

Many thanks for your comment. We have received similar comments,
and have already decided to make some changes, although we don't
yet know the exact wording.

We think that the Appendix of XML 1.0 does not give heuristics,
in that no two implementations will interpret a document as
being encoded in different encodings. (there may of course
be differences in what encodings the two implementations accept).

By heuristics, we mainly meant statistically or otherwise
judgement-based or guessing-like procedures that won't be
implemented consistently in different implementations.
Do you have any idea of how to express this in a better way?

Regards,    Martin.

At 11:18 02/07/12 +0900, C. M. Sperberg-McQueen wrote:

>This is a last call comment from C. M. Sperberg-McQueen (cmsmcq@acm.org) on
>the Character Model for the World Wide Web 1.0
>(http://www.w3.org/TR/2002/WD-charmod-20020430/).
>
>Semi-structured version of the comment:
>
>Submitted by: C. M. Sperberg-McQueen (cmsmcq@acm.org)
>Submitted on behalf of (maybe empty):
>Comment type: substantive
>Chapter/section the comment applies to: 3.6 Choice and Identification of 
>Character Encodings
>The comment will be visible to: public
>Comment title: Heuristics
>Comment:
>The spec says "[S] Specifications MUST NOT propose the use of heuristics
>to determine the encoding of data."  Is it your intent to outlaw the
>heuristics given by the XML 1.0 spec?  If not, I believe the wording
>of this requirement is incorrect, and should be changed to something
>specifying that heuristics should be used only in the absence of
>usable labels.  If it is your intent to disallow the heuristics defined
>in XML 1.0, I disagree, and believe the MUST NOT should be changed to
>something weaker, preferably a generic health warning.
Received on Friday, 12 July 2002 23:51:57 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 October 2009 08:32:32 GMT