- From: Norman Walsh <ndw@nwalsh.com>
- Date: Wed, 12 Dec 2007 15:26:46 -0500
- To: public-xml-processing-model-wg@w3.org
- Message-ID: <m2r6hrzmk9.fsf@nwalsh.com>
Interesting and useful, but not in V1.
/ ht@inf.ed.ac.uk (Henry S. Thompson) was heard to say:
| I gave a presentation on the Last Call WD to some of the XML people at
| MSFT on Friday, and got a pleasantly positive reception.
|
| One specific question they were interested in, looking towards very
| large scale data processing with parallel hardware available, was
| whether we supported Google 'map-reduce' style decomposition. I
| mentioned the inherent parallelisability of the overall architecture,
| but realised we did not have anything which would directly support
| such decomposition. Maybe we should consider it. . .
|
| We already have 'map' -- it's just for-each with a select pattern on
| its input.
|
| Here's an example of how it could be used along with a new 'reduce'
| construct:
|
| Stipulate we have a pipeline which can construct an index for a book
| chapter. Here's how we index the whole book:
|
| <for-each select='//chapter'>
| [compute index]
| </for-each>
|
| <reduce name='r'>
| <input port="seed">
| <inline>
| </bookIndex>
| </inline>
| </input>
|
| <merge-two-indices>
| <input port='book'>
| <pipe port='seed' step='r'/>
| </input>
| </merge-two-indices>
|
| </reduce>
|
| where merge-two-indices has two inputs, primary a chapter index and
| secondary a book index, and one output, a new book index merging in
| the chapter index.
|
| reduce takes a primary sequence input and a secondary single input
| (the seed) and a subpipeline. It runs the subpipeline repeatedly,
| supplying each member of the sequence in turn as the default input and
| first time the seed, and subsequent times the output of the previous
| round, as the 'seed' input. Output is the output of the subpipeline
| From the last iteration.
|
| Such a construct would give us a way of addressing our current lack of
| open-ended/runtime input/output cardinality.
|
| ht
| --
| Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
| Half-time member of W3C Team
| 2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
| Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk
| URL: http://www.ltg.ed.ac.uk/~ht/
| [mail really from me _always_ has this .sig -- mail without it is forged spam]
Be seeing you,
norm
--
Norman Walsh <ndw@nwalsh.com> | Do not condemn the judgement of another
http://nwalsh.com/ | because it differs from your own. You
| may both be wrong.-- Dandemis
Received on Wednesday, 12 December 2007 20:27:02 UTC