At 19:44 16/11/96 -0800, Tim Bray wrote: >Also - a very material problem - with the current language, it is simply >impossible to base a full-text indexer on an XML parser; indexers often >need to know the byte offsets of words in entities. OK, there are other >problems: the processor needs to provide more data, e.g. lengths of excised >comments and entity references, but these can be added without breaking the >spec - the application of -xml-space="COLLAPSE" to any element fatally >cripples a full-text indexer. It is perfectly possible for a processor to collapse white space whilst reporting byte offsets. In fact SP does it today (in public identifiers). JamesReceived on Sunday, 17 November 1996 09:22:59 EST
This archive was generated by hypermail pre-2.1.9 : Wednesday, 24 September 2003 10:03:43 EDT