- From: James Clark <jjc@jclark.com>
- Date: Sun, 17 Nov 1996 13:22:11 +0000
- To: Tim Bray <tbray@textuality.com>
- Cc: w3c-sgml-wg@w3.org
At 19:44 16/11/96 -0800, Tim Bray wrote: >Also - a very material problem - with the current language, it is simply >impossible to base a full-text indexer on an XML parser; indexers often >need to know the byte offsets of words in entities. OK, there are other >problems: the processor needs to provide more data, e.g. lengths of excised >comments and entity references, but these can be added without breaking the >spec - the application of -xml-space="COLLAPSE" to any element fatally >cripples a full-text indexer. It is perfectly possible for a processor to collapse white space whilst reporting byte offsets. In fact SP does it today (in public identifiers). James
Received on Sunday, 17 November 1996 09:22:59 UTC