- From: Dave Lewis <dave.lewis@cs.tcd.ie>
- Date: Mon, 18 Feb 2013 18:27:29 +0000
- To: Multilingual Web LT Public List <public-multilingualweb-lt@w3.org>
- Message-ID: <51227291.1080002@cs.tcd.ie>
Hi all,
Leroy has placed a series of ITS+XLIFF examples up at:
https://github.com/finnle/ITS-2.0-Testsuite/tree/master/its2.0/xliffsamples/roundtrip-example
We've handcrafted these to show incremental annotation of an xliff
document by different activities, to form a linked set of transforms as
detailed below. We have a stub for an LQI step, but we've not populated
this yet.
Comments on these would be very welcome from:
* XLIFF-ITS mapping folk (David, Yves): though this not in most cases
showing a mapping, it provides some best practice on generating ITs
markup within an XLIFF workflow. Later we will implment the ability
to generate a snapshot HTML for each stage - to the degree that
makes sense - to validate ITS+XLIFF to ITS+HTML mapping.
* David, Sean will this work with SOLAS OK? In some cases, you may
need to map this and populate the provenance data yourselves.
Specifically, could you consume and produce this XLIFF in SOLAS for
the MT mapper component if calling Bing (which you won't have to do
for Matrex if Ankit can implement the XLIFF interface)
* Tadej: do the XLIFF input and output for the text analytics step
make sense? Would that be a format you'd be willing to support in
Enrycher?
* Marcis: similar question around terminology management (though for
the demo we will build a simple front end to demo in Rome - so its a
feasibility question, rather than any implementation request)
* Ankit: we've already discussed using this XLIFF format with Matrex -
so please confirm if this is doable
* David, Sean:
* anyone interested in mapping to W3C PROV model - we are starting to
build a parallel best practice example based on these files at:
http://www.w3.org/International/multilingualweb/lt/wiki/Provenance_Best_Practice#Example:_RDF-PROV_from_XLIFF.2FITS_Roundtrip
the activities map to files as inputs/outputs as follows:
* extraction:
o from
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-src.html
o to
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf
* segmentation
o from:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf
o to:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf
* text analytics;
o from:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf
o to:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf
* terminology management (building gloassary from output of text
analytics using dbpedia queries for target term)
o from:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf
o to:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf
* two sets of MT
o from;
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf
o to:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-matrex.xlf
o and then to:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf
* a PE activity:
o from:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf
o to:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf
* reassembly
o from:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf
o to:
https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-tgt.html
cheers,
Dave
Received on Monday, 18 February 2013 18:26:57 UTC