- From: Henry S. Thompson <ht@inf.ed.ac.uk>
- Date: Wed, 04 Apr 2007 17:31:09 +0100
- To: Norman Walsh <Norman.Walsh@Sun.COM>
- Cc: public-xml-processing-model-wg@w3.org
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Regrets, I'm out of the office tomorrow. Richard can vote the Univ. Edinburgh position -- if he's away, as I think he might be, proxy to the chair. MSM can vote the W3C position -- likewise, if _he_ is away, proxy to the chair. Wrt the issue at hand: Norman Walsh writes: > The caching issue has me quite concerned. I see three options: > > 1. We say nothing about it. Some implementations will cache, others > won't. Even on systems that don't cache, side effects of operation > will sometimes make caching appear to happen and sometimes not. Not acceptable -- too much negative impact on interop. > 2. We require caching. This may be a significant implementation issue. > It looks like a big step for V1. My preference, but see below -- I think we have to carefully bound the impact of the requirement. > 3. We forbid caching. This may be a significant implementation issue. > It may not even be possible for some implementations to prevent side > effects from "effective" caching. Not sure we couldn't find a way to make this work (along the lines of QT's "fetch once" requirement), but I'd rather put the work into speccing a limited-scope required caching. I guess this does involve forbidding it _outside_ that scope, hmmm. Anyway, what I have in mind starts from my previous post on the subject [1]: "Give 'p:group' an input, call it 'cache', and specify that any documents presented at that port must function as a local cache for any http GET issued by any step inside the group's subpipeline." To clarify a bit -- this would require implementations to: a) Not start evaluation of the p:group's subpipeline until all the inputs into the 'cache' port are complete; b) Taking all the documents presented to that port and indexing them via the [base URI] of their Document Information Item to create a local cache (two minor issues: what about docs with no [base URI]? What about duplicates?); c) Intercepting at least all http: and file: requests and supplying them from the cache if possible (issues: is this any use if it only covers explicit inputs, i.e. <p:document href="..."/> -- I think not. So it means getting into the platform retrieval stack(s) -- how hard _is_ this?); d) Specifying what happens _outside_ of p:group with a 'cache' port, e.g. do we go with a "first fetch" story along the lines of QT, or what? Sorry I can't be there to join in the fun, ht [1] http://lists.w3.org/Archives/Public/public-xml-processing-model-wg/2007Mar/0138.html - -- Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh Half-time member of W3C Team 2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440 Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk URL: http://www.ltg.ed.ac.uk/~ht/ [mail really from me _always_ has this .sig -- mail without it is forged spam] -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.6 (GNU/Linux) iD8DBQFGE9LNkjnJixAXWBoRAlgOAJ49Cl72w26Ivy1yHNNpoKGfj8QwNgCePuPz LGPjpiRjJavFW8FVV6yG5aY= =760H -----END PGP SIGNATURE-----
Received on Wednesday, 4 April 2007 16:31:24 UTC