W3C home > Mailing lists > Public > public-multilingualweb-lt@w3.org > February 2013

[all] XLIFF round-trip samples

From: Dave Lewis <dave.lewis@cs.tcd.ie>
Date: Mon, 18 Feb 2013 18:27:29 +0000
Message-ID: <51227291.1080002@cs.tcd.ie>
To: Multilingual Web LT Public List <public-multilingualweb-lt@w3.org>
Hi all,
Leroy has placed a series of ITS+XLIFF examples up at:
https://github.com/finnle/ITS-2.0-Testsuite/tree/master/its2.0/xliffsamples/roundtrip-example

We've handcrafted these to show incremental annotation of an xliff 
document by different activities, to form a linked set of transforms as 
detailed below.  We have a stub for an LQI step, but we've not populated 
this yet.

Comments on these would be very welcome from:

  * XLIFF-ITS mapping folk (David, Yves): though this not in most cases
    showing a mapping, it provides some best practice on generating ITs
    markup within an XLIFF workflow. Later we will implment the ability
    to generate a snapshot HTML for each stage - to the degree that
    makes sense - to validate ITS+XLIFF to ITS+HTML mapping.
  * David, Sean will this work with SOLAS OK? In some cases,  you may
    need to map this and populate the provenance data yourselves.
    Specifically, could you consume and produce this XLIFF in SOLAS for
    the MT mapper component if calling Bing (which you won't have to do
    for Matrex if Ankit can implement the XLIFF interface)
  * Tadej: do the XLIFF input and output for the text analytics step
    make sense? Would that be a format you'd be willing to support in
    Enrycher?
  * Marcis: similar question around terminology management (though for
    the demo we will build a simple front end to demo in Rome - so its a
    feasibility question, rather than any implementation request)
  * Ankit: we've already discussed using this XLIFF format with Matrex -
    so please confirm if this is doable
  * David, Sean:
  * anyone interested in mapping to W3C PROV model - we are starting to
    build a parallel best practice example based on these files at:
    http://www.w3.org/International/multilingualweb/lt/wiki/Provenance_Best_Practice#Example:_RDF-PROV_from_XLIFF.2FITS_Roundtrip

the activities map to files as inputs/outputs as follows:

  * extraction:
      o from
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-src.html
      o to
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf
  * segmentation
      o from:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-extract.xlf
      o to:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf
  * text analytics;
      o from:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-seg.xlf
      o to:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf
  * terminology management (building gloassary from output of text
    analytics using dbpedia queries for target term)
      o from:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-tan.xlf
      o to:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf
  * two sets of MT
      o from;
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-term.xlf
      o to:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-matrex.xlf
      o and then to:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf
  * a PE activity:
      o from:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-MT-bing.xlf
      o to:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf
  * reassembly
      o from:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-post-PE.xlf
      o to:
        https://github.com/finnle/ITS-2.0-Testsuite/blob/master/its2.0/xliffsamples/roundtrip-example/EX-xliff-prov-rt-1-tgt.html

cheers,
Dave
Received on Monday, 18 February 2013 18:26:57 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 9 June 2013 00:25:08 UTC