W3C home > Mailing lists > Public > public-multilingualweb-lt@w3.org > February 2013

RE: Sample XLIFF files decorated with ITS 2.0 markup

From: Philip.O'Duffy <Philip.ODuffy@ul.ie>
Date: Thu, 7 Feb 2013 17:15:41 -0000
Message-ID: <69B6BE2FE67F244B8BAE81C0556C60B805FD4BF7@staffexchange4.ul.campus>
To: "Felix Sasaki" <fsasaki@w3.org>, <public-multilingualweb-lt@w3.org>
Cc: "Karl Fritsche" <karl.fritsche@cocomore.com>
Hey Guys,
   Yea, we noticed the lack of Xliff tags and extra header info eventually too. Here's the cleaned up version we made.
Thanks,
Philip


-----Original Message-----
From: Felix Sasaki [mailto:fsasaki@w3.org]
Sent: Thu 2/7/2013 4:56 PM
To: public-multilingualweb-lt@w3.org
Cc: Karl Fritsche
Subject: Re: Sample XLIFF files decorated with ITS 2.0 markup
 
Am 07.02.13 17:49, schrieb Yves Savourel:
> I think David (or more exactly Philip?) have been generating a big XLIFF file like this.
>
> But I suppose we could automate something and generate something like that more often (as our version of Okapi is likely up-to-date earlier than UL's)
>
> But one caveat: such output is rather meaningless as it will create XLIFF files with a single data category applies since the test files test one at a time. It might be better to have a couple of XML and HTML5 stuffed with all the data categories we can manage and generate the XLIFF from that. Actually such XML/HTML file could be useful as a more general test file for developer (not necessarily for conformance tests).
>
> I'll see if we can do that.
> Would that be ok to put it in some directory in the Test suite? or should that go somewhere else?

I would put such files into a separate directory under 
https://github.com/finnle/ITS-2.0-Testsuite/
and write a short documentation 1-2 paragraphs explaining that this is 
not part of the conformance testing.

I'm putting Karl into the loop since I think in the Cocomore / 
Linguaserve also a lot of HTML documents with multiple data categories 
have been created. Maybe a subset of these could also be input to the 
XLIFF example file set?

Best,

Felix
>
> -ys
>
> -----Original Message-----
> From: Felix Sasaki [mailto:fsasaki@w3.org]
> Sent: Thursday, February 07, 2013 9:14 AM
> To: public-multilingualweb-lt@w3.org
> Subject: Re: Sample XLIFF files decorated with ITS 2.0 markup
>
> Hi Arle, Marcis, all,
>
> AFAIF Okapi is creating XLIFF files in this use case http://www.w3.org/International/multilingualweb/lt/wiki/Use_cases_-_high_level_summary#Translation_Package_Creation
> from the list in that section, everything what you want Marcis would be used - but Fredrik and Yves can confirm. The only exception is language information, but here you might use xml:lang natively in XLIFF anyway?
> So maybe you can convince Yves and Fredrik to generate XLIFF with the test suite files in a batch job, and you have your sample? Just an idea.
>
> Best,
>
> Felix
>
> Am 07.02.13 17:04, schrieb Marcis Pinnis:
>> Hi All,
>>
>> We would also like to know whether there are XLIFF+ITS examples.
>>
>> We are interested in seeing Terminology, Language Information, Domain, Elements Within Text and Locale Filter examples (if there are any).
>>
>> Best regards,
>> Marcis ;o)
>>
>> -----Original Message-----
>> From: Arle Lommel [mailto:arle.lommel@dfki.de]
>> Sent: Thursday, February 07, 2013 3:29 PM
>> To: Multilingual Web LT Public List Public List
>> Cc: alan melby
>> Subject: Sample XLIFF files decorated with ITS 2.0 markup
>>
>> I just received a request from Alan Melby of the BYU Translation Research Group (TRG) for some sample (real) translation data in XLIFF format decorated with ITS 2.0 markup. Is there anything publicly available as sample data of this nature? The goal is to see some real data with at least a few ITS 2.0 categories in the data so Alan Melby's team can get a view of it in a practical setting. Bigger data sets would be better, but even small ones of a few dozens of segments would be useful.
>>
>> Best,
>>
>> -Arle
>>
>
>







Received on Thursday, 7 February 2013 17:16:07 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 9 June 2013 00:25:08 UTC