[Prev][Next][Index][Thread]

Re: Feedback on the spec



At 19:44 16/11/96 -0800, Tim Bray wrote:

>Also - a very material problem - with the current language, it is simply
>impossible to base a full-text indexer on an XML parser; indexers often
>need to know the byte offsets of words in entities.  OK, there are other
>problems: the processor needs to provide more data, e.g. lengths of excised 
>comments and entity references, but these can be added without breaking the 
>spec - the application of -xml-space="COLLAPSE" to any element fatally 
>cripples a full-text indexer.

It is perfectly possible for a processor to collapse white space whilst
reporting byte offsets.  In fact SP does it today (in public identifiers).

James