W3C home > Mailing lists > Public > public-multilingualweb-lt@w3.org > January 2013

Re: Locworld June Re: Update [Agenda] MLW-LT call 7 January, 4 p.m. UTC

From: Tadej Štajner <tadej.stajner@ijs.si>
Date: Wed, 09 Jan 2013 12:22:02 +0100
Message-ID: <50ED52DA.3000406@ijs.si>
To: Dave Lewis <dave.lewis@cs.tcd.ie>
CC: "Dr. David Filip" <David.Filip@ul.ie>, Yves Savourel <ysavourel@enlaso.com>, Felix Sasaki <fsasaki@w3.org>, public-multilingualweb-lt@w3.org
Hi, all,
while I can't commit to coming to LocWorld, I can contribute some 
content to a submission on the disambiguation part, and make a demo 
-- Tadej

On 09. 01. 2013 02:28, Dave Lewis wrote:
> David,
> You are right, this sort of opportunistic MT use of disambiguation 
> data would suit more the home spun/push button trained MT market, I'd 
> presume the big guys like Google would have sucked DBpedia dry of all 
> useful parallel text already.
> The other two use cases you describe are also very credible options, 
> and probably more immediately understandable to localization folk so 
> very worth pursuing.
> cheers,
> Dave
> On 08/01/2013 22:25, Dr. David Filip wrote:
>> Dave, says that the disambiguation is more useful for MT. I do not 
>> entirely agree.
>> I do not see large MT providers like Google building support for this 
>> info. On the other hand, SOLAS consumes the disambiguation info and 
>> presents it to the human agent, as a color-coded span with linked 
>> reference, eventually prose comment.
>> People keep talking about importance of terminology management, 
>> however terminology has always lower priority compared to direct 
>> leveraging, as it is harder to put a financial benefit tag on it. 
>> Getting marked terminology candidates via ITS encoded disambiguation 
>> info is very attractive bootstrapping scenario for human translation..
>> Apart from that using disambiguation as term candidates in a broder 
>> term life cycle is an obvious backdoor for terminology (term), which 
>> is good.
Received on Wednesday, 9 January 2013 11:23:47 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:32:00 UTC