- From: <bugzilla@wiggum.w3.org>
- Date: Tue, 14 Aug 2007 04:56:59 +0000
- To: public-qt-comments@w3.org
- CC:
http://www.w3.org/Bugs/Public/show_bug.cgi?id=4946 Summary: "all markup creates token boundaries" Product: XPath / XQuery / XSLT Version: Working drafts Platform: PC OS/Version: Linux Status: NEW Severity: normal Priority: P2 Component: Full Text AssignedTo: jim.melton@acm.org ReportedBy: jmdyck@ibiblio.org QAContact: public-qt-comments@w3.org As decided at meeting 147, a sentence has been added (in the editor's draft) to section 1.1: In the absence of an implementation-defined way to differentiate, all markup creates token boundaries. However, XML's definition of "markup" <http://www.w3.org/TR/2006/REC-xml-20060816/#syntax> is perhaps broader than what we had in mind when we said "all markup". For instance, it seems unlikely that we meant for a character reference to create a token boundary. Similarly for entity references and perhaps CDATA section delimiters. We could be more specific about which kinds of markup we mean, but instead, maybe we shouldn't be relying on the idea of markup. Full-Text operates on instances of the XQuery/XPath Data Model, where markup doesn't exist. So, for example, we might say: In the absence of an implementation-defined indication otherwise, a token must not contain characters from more than one node. (although we might have to make that more precise).
Received on Tuesday, 14 August 2007 04:57:02 UTC