RE: [all] XLIFF round-trip samples from Yves Savourel on 2013-02-19 (public-multilingualweb-lt@w3.org from February 2013)

From: Yves Savourel <ysavourel@enlaso.com>
Date: Tue, 19 Feb 2013 08:12:46 -0700
To: "'Phil Ritchie'" <philr@vistatec.ie>, "'Dave Lewis'" <dave.lewis@cs.tcd.ie>
CC: "'Multilingual Web LT Public List'" <public-multilingualweb-lt@w3.org>
Message-ID: <001101ce0eb3$935a5a30$ba0f0e90$@com>
And here is a bit more well-formed and valid version :)

-ys

 

From: Phil Ritchie [mailto:philr@vistatec.ie] 
Sent: Tuesday, February 19, 2013 7:51 AM
To: Dave Lewis
Cc: Multilingual Web LT Public List
Subject: Re: [all] XLIFF round-trip samples

 

Dave 

An LQI sample attached. 



Phil.





From:        Dave Lewis <dave.lewis@cs.tcd.ie> 
To:        Multilingual Web LT Public List <public-multilingualweb-lt@w3.org>, 
Date:        18/02/2013 18:28 
Subject:        [all] XLIFF round-trip samples 

  _____  




Hi all,
Leroy has placed a series of ITS+XLIFF examples up at:
https://github.com/finnle/ITS-2.0-Testsuite/tree/master/its2.0/xliffsamples/roundtrip-example

We've handcrafted these to show incremental annotation of an xliff document by different activities, to form a linked set of transforms as detailed below.  We have a stub for an LQI step, but we've not populated this yet.

Comments on these would be very welcome from: 

* XLIFF-ITS mapping folk (David, Yves): though this not in most cases showing a mapping, it provides some best practice on generating ITs markup within an XLIFF workflow. Later we will implment the ability to generate a snapshot HTML for each stage - to the degree that makes sense - to validate ITS+XLIFF to ITS+HTML mapping. 
* David, Sean will this work with SOLAS OK? In some cases,  you may need to map this and populate the provenance data yourselves. Specifically, could you consume and produce this XLIFF in SOLAS for the MT mapper component if calling Bing (which you won't have to do for Matrex if Ankit can implement the XLIFF interface) 
* Tadej: do the XLIFF input and output for the text analytics step make sense? Would that be a format you'd be willing to support in Enrycher? 
* Marcis: similar question around terminology management (though for the demo we will build a simple front end to demo in Rome - so its a feasibility question, rather than any implementation request) 
* Ankit: we've already discussed using this XLIFF format with Matrex - so please confirm if this is doable 
* David, Sean: 
* anyone interested in mapping to W3C PROV model - we are starting to build a parallel best practice example based on these files at: http://www.w3.org/International/multilingualweb/lt/wiki/Provenance_Best_Practice#Example:_RDF-PROV_from_XLIFF.2FITS_Roundtrip

the activities map to files as inputs/outputs as follows: 

* extraction: 

* from https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-src.html 
* to https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf

* segmentation 

* from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf 
* to:  https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf

* text analytics; 

* from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf 
* to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf

* terminology management (building gloassary from output of text analytics using dbpedia queries for target term) 

* from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf 
* to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf

* two sets of MT 

* from; https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf 
* to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-matrex.xlf 
* and then to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf

* a PE activity: 

* from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf 
* to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf

* reassembly 

* from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf 
* to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-tgt.html

cheers,
Dave




************************************************************
This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom they
are addressed. If you have received this email in error please notify
the sender immediately by e-mail.

www.vistatec.com
************************************************************
Attachments

application/octet-stream attachment: ex-xliff-lqa.xlf
Received on Tuesday, 19 February 2013 15:13:24 UTC