- From: Phil Ritchie <philr@vistatec.ie>
- Date: Tue, 19 Feb 2013 14:51:10 +0000
- To: Dave Lewis <dave.lewis@cs.tcd.ie>
- Cc: Multilingual Web LT Public List <public-multilingualweb-lt@w3.org>
- Message-ID: <OF97A37B67.D45ED88D-ON80257B17.005180F2-80257B17.00519663@vistatec.ie>
Dave An LQI sample attached. Phil. From: Dave Lewis <dave.lewis@cs.tcd.ie> To: Multilingual Web LT Public List <public-multilingualweb-lt@w3.org>, Date: 18/02/2013 18:28 Subject: [all] XLIFF round-trip samples Hi all, Leroy has placed a series of ITS+XLIFF examples up at: https://github.com/finnle/ITS-2.0-Testsuite/tree/master/its2.0/xliffsamples/roundtrip-example We've handcrafted these to show incremental annotation of an xliff document by different activities, to form a linked set of transforms as detailed below. We have a stub for an LQI step, but we've not populated this yet. Comments on these would be very welcome from: XLIFF-ITS mapping folk (David, Yves): though this not in most cases showing a mapping, it provides some best practice on generating ITs markup within an XLIFF workflow. Later we will implment the ability to generate a snapshot HTML for each stage - to the degree that makes sense - to validate ITS+XLIFF to ITS+HTML mapping. David, Sean will this work with SOLAS OK? In some cases, you may need to map this and populate the provenance data yourselves. Specifically, could you consume and produce this XLIFF in SOLAS for the MT mapper component if calling Bing (which you won't have to do for Matrex if Ankit can implement the XLIFF interface) Tadej: do the XLIFF input and output for the text analytics step make sense? Would that be a format you'd be willing to support in Enrycher? Marcis: similar question around terminology management (though for the demo we will build a simple front end to demo in Rome - so its a feasibility question, rather than any implementation request) Ankit: we've already discussed using this XLIFF format with Matrex - so please confirm if this is doable David, Sean: anyone interested in mapping to W3C PROV model - we are starting to build a parallel best practice example based on these files at: http://www.w3.org/International/multilingualweb/lt/wiki/Provenance_Best_Practice#Example:_RDF-PROV_from_XLIFF.2FITS_Roundtrip the activities map to files as inputs/outputs as follows: extraction: from https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-src.html to https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf segmentation from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf text analytics; from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf terminology management (building gloassary from output of text analytics using dbpedia queries for target term) from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf two sets of MT from; https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-matrex.xlf and then to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf a PE activity: from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf reassembly from: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf to: https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-tgt.html cheers, Dave ************************************************************ This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the sender immediately by e-mail. www.vistatec.com ************************************************************
Attachments
- application/octet-stream attachment: ex-xliff-lqa.xlf
Received on Tuesday, 19 February 2013 14:51:48 UTC