W3C home > Mailing lists > Public > w3c-sgml-wg@w3.org > November 1996

Re: Feedback on the spec

From: James Clark <jjc@jclark.com>
Date: Sun, 17 Nov 1996 13:22:11 +0000
Message-Id: <>
To: Tim Bray <tbray@textuality.com>
Cc: w3c-sgml-wg@w3.org
At 19:44 16/11/96 -0800, Tim Bray wrote:

>Also - a very material problem - with the current language, it is simply
>impossible to base a full-text indexer on an XML parser; indexers often
>need to know the byte offsets of words in entities.  OK, there are other
>problems: the processor needs to provide more data, e.g. lengths of excised 
>comments and entity references, but these can be added without breaking the 
>spec - the application of -xml-space="COLLAPSE" to any element fatally 
>cripples a full-text indexer.

It is perfectly possible for a processor to collapse white space whilst
reporting byte offsets.  In fact SP does it today (in public identifiers).

Received on Sunday, 17 November 1996 09:22:59 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:25:20 UTC