- From: James K. Tauber <jtauber@jtauber.com>
- Date: Tue, 20 May 1997 08:44:45 +0800
- To: "'Gavin Nicol'" <gtn@eps.inso.com>
- Cc: "Peter@ursus.demon.co.uk" <Peter@ursus.demon.co.uk>, "w3c-sgml-wg@w3.org" <w3c-sgml-wg@w3.org>
Ah yes, the problematic definition of 'word'---brings back memories of first-year linguistics :-) Would *glyph* counting solve any of the problems? -- James K. Tauber / jtauber@jtauber.com Perth, Western Australia -----Original Message----- From: Gavin Nicol [SMTP:gtn@eps.inso.com] Sent: Tuesday, May 20, 1997 2:56 AM To: jtauber@jtauber.com Cc: Peter@ursus.demon.co.uk; w3c-sgml-wg@w3.org Subject: RE: Link-6: Addressing at the sub-element level >Could somebody outline the problems Unicode presents for character = >counting? Is it deciding how to count precomposed characters versus = >combinations or what? Yes. This and also the fact that certain combinations can cause what appears to the user as identical display, though the ordering might be different. You also run into problems with the definition of "word" if you wish to use that quantum.
Received on Monday, 19 May 1997 20:52:24 UTC