W3C home > Mailing lists > Public > w3c-sgml-wg@w3.org > November 1996

Re: Feedback on the spec

From: James Clark <jjc@jclark.com>
Date: Sun, 17 Nov 1996 13:22:11 +0000
Message-Id: <2.2.32.19961117132211.009fec94@pop.jclark.com>
To: Tim Bray <tbray@textuality.com>
Cc: w3c-sgml-wg@w3.org
At 19:44 16/11/96 -0800, Tim Bray wrote:

>Also - a very material problem - with the current language, it is simply
>impossible to base a full-text indexer on an XML parser; indexers often
>need to know the byte offsets of words in entities.  OK, there are other
>problems: the processor needs to provide more data, e.g. lengths of excised 
>comments and entity references, but these can be added without breaking the 
>spec - the application of -xml-space="COLLAPSE" to any element fatally 
>cripples a full-text indexer.

It is perfectly possible for a processor to collapse white space whilst
reporting byte offsets.  In fact SP does it today (in public identifiers).

James
Received on Sunday, 17 November 1996 09:22:59 EST

This archive was generated by hypermail pre-2.1.9 : Wednesday, 24 September 2003 10:03:43 EDT