W3C home > Mailing lists > Public > www-zig@w3.org > June 2001

RE: New Diagnostic Code Requested

From: LeVan,Ralph <levan@oclc.org>
Date: Thu, 7 Jun 2001 09:00:43 -0400
Message-ID: <E5431CF93E29F9478878F623E5B9CE9835091A@OA3-SERVER.oa.oclc.org>
To: www-zig@w3.org
> -----Original Message-----
> From: Robert Waldstein [mailto:wald@library.ho.lucent.com]
>
> QUery by example:
>     - So does a with umlaut translate to a
>     - diphthong (ae)         translate to ae (a followed by e)
>     - oneHalf (1 over 2)    translate to  1/2
>     - a superscript 2       translate to  a 2
>     - capital A            translate to "a" (guess we handle 
> this with an attribute, do we do the others?)

Bob, I think most of your examples fall into what I would call
normalization, not translation.  The character AE-ae is not the same as an a
followed by an e.  If the target characterset does not contain the AE-ae,
then a diagnostic to that effect should be sent back and the client/user can
decide whether to turn the diphthong into an "ae".

Now, we're heading into tricky ground here.  I do all sorts of
normalization.  I shift case, I pull apostrophes out of words, I throw away
punctuation.  I throw away diacritics.  Some of that behavior is going to
have to change as I support more non-English data.

Ralph
Received on Thursday, 7 June 2001 09:00:51 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 29 October 2009 06:12:21 GMT