- From: Gary Adams - Sun Microsystems Labs BOS <gra@zeppo.East.Sun.COM>
- Date: Fri, 18 Oct 1996 08:59:16 -0400
- To: keld@dkuug.dk, mduerst@ifi.unizh.ch, www-international@w3.org, Chris.Lilley@sophia.inria.fr
- Cc: rosenne@NetVision.net.il
After a couple of days on this thread I'm not sure I have a clear picture of the key issues. I think I heard : - ASCII is an unneccessary restriction and not sufficient for covering all the intended uses of CLASS names. Few people seriously consider this as a viable long term option. - compressing case (toupper or tolower) from CLASS names might provide some end user ease of use features, if a consistent algorithm can be used on all platforms to get identical results. Some people still consider this a worth while option, but many believe it to be a counter productive proposal. - in addition to case, many western languages could also benefit from composition/decomposition of accented characters when matching CLASS names. Many people believe that there are real benefits to a canonical representation that has learned from the mistakes of past proprietary or national encoding standards. - it's clear that the canonical representation of names cannot rely on the character input method or the native platform character encoding. Section 5.15 in the Unicode 2.0 standard provides a good look at the searching/sorting problems that users will be faced with. At the start of this discussion, I did not think users would actually be entering CLASS names manually, but that a selection system would be provided by applications to select/create templates. Ultimately a class is selected/defined because it represents a particular set of behaviours or it is an anchor point (e.g. a name) to which the behaviours can be attached. Going beyond the simple case of I18N/L10N monolingual environments a user may be faced with selecting classes and libraries of classes that have internally consistent names and that require additional mechanisms for use by non-native speakers to locate and identify them. e.g. aliases, translations, transliterations, etc. During the authoring process, I can think of many good reasons to be forgiving in the matching algorithms dealing with, case, accents, tones, etc. I'd even consider synonyms and typographically correctable errors in providing users with a lists of classes that meet a particular criteria, but in a runtime system which could be fetching a style sheet definition from a remote database of styles, I'd prefer to treat the CLASS name string as an exact database key for simple string matching purposes. e.g. the user of a class is presented the string that the author of the class originally typed or the database encoded as part of it's schema processing.
Received on Friday, 18 October 1996 09:00:44 UTC