W3C home > Mailing lists > Public > public-multilingualweb-lt@w3.org > February 2013

agenda+ for Wednesday (WG Draft about high level use cases (Re: [ISSUE-75] - Domain - 2.a. [ACTION-434]))

From: Felix Sasaki <fsasaki@w3.org>
Date: Tue, 05 Feb 2013 16:52:09 +0100
Message-ID: <51112AA9.5080702@w3.org>
To: "public-multilingualweb-lt@w3.org" <public-multilingualweb-lt@w3.org>
Hi all,

this is to resolve Pablo's comment on the wiki link.

Dave, I can't be on the Wednesday call, but could you discuss this topic:

"publising
http://www.w3.org/International/multilingualweb/lt/wiki/Use_cases_-_high_level_summary
as a w3c working draft (not a final document) before Rome"

With the aim to finally publish it as a "note" (= an informative best 
practice document). Needed to for that would be *by 18 February*:

- a "clean up" of sections, so that they don't contain coments any more 
and follow the style of
http://www.w3.org/International/multilingualweb/lt/wiki/Use_cases_-_high_level_summary#Simple_Machine_Translation
- useful links to tool demos or further info if available)
- sometimes re-naming and editing, e.g.
http://www.w3.org/International/multilingualweb/lt/wiki/Use_cases_-_high_level_summary#Annotation_of_Named_Entities
to
"text analysis annotation"? see issue-68

Per partner this would actually not be a lot of work: I assume 1-2 
hours. Dave, could you discuss whether people would be available to do 
that by 18 February? I would then take care of a short introduction. 
Conversion to a working draft format will happen on the fly, see

http://services.w3.org/xslt?xmlfile=http%3A%2F%2Fwww.w3.org%2FInternational%2Fmultilingualweb%2Flt%2Fwiki%2FUse_cases_-_high_level_summary&specinfo=http%3A%2F%2Fwww.w3.org%2F2012%2F05%2Fwiki2spec%2Fwiki2spec-parameters.html&xslfile=http%3A%2F%2Fwww.w3.org%2F2012%2F05%2Fwiki2spec%2Fwiki2spec.xsl

proposal of the location of the draft and name:

http://www.w3.org/TR/mlw-metadata-uc-impl
"Metadata for the Multilingual Web - Use Cases and Implementations"


Thanks,

Felix

Am 05.02.13 16:17, schrieb Pablo Nieto Caride:
> Hi,
>
> It also works for me, although as I think I mentioned before I don't know
> whether to use links to the wiki is appropriate or not.
>
> Cheers,
> Pablo.
> That works for me
>
> Thanks,
>
> Jan
>
> ________________________________________
> From: Lieske, Christian [christian.lieske@sap.com]
> Sent: Tuesday, February 05, 2013 5:35 AM
> To: public-multilingualweb-lt-comments@w3.org
> Cc: joerg@bioloom.de
> Subject: RE: [ISSUE-75] - Domain - 2.a. [ACTION-434]
>
> Hi,
>
> I had an action item to re-write the note related to "domainMapping" in
> "multi-engine" scenarios. Here is comes ...
>
> Cheers,
> Christian
> ==
> Although the focus of ITS 2.0, and some of the usage scenarios addressed in
> ITS 2.0 showcases (see
> http://www.w3.org/International/multilingualweb/lt/wiki/Use_cases_-_high_lev
> el_summary#ITS_2.0_Metadata:_Work-In-Context_Showcase) is on "single engine"
> environments, ITS 2.0 - for example in the context of the "domain" data
> category - can accommodate "workflow/multi engine" scenarios.
>
> Example:
>
> - A scenario involves Machine Translation (MT) engines A and B. The domain
> labels used by engine A follow the naming scheme A_123, the one for engine B
> follow the naming scheme B_456.
> - A "domainMapping" like the following is in place: domainMapping="'sports
> law' Legal, 'property law' Legal"
> - Engine A maps 'Legal' to A_4711, Engine B maps 'Legal' to B_42.
>
> Thus, ITS does not encode a process or workflow (like "Use MT engine A with
> domain A_4711, and use MT engine B with domain A_42"). Rather, it encodes
> information that can be used in workflows.
> -----Original Message-----
> From: Jörg Schütz [mailto:joerg@bioloom.de]
> Sent: Mittwoch, 30. Januar 2013 09:37
> To: public-multilingualweb-lt-comments@w3.org
> Subject: Re: [ISSUE-75] - Domain - 2.a. incl. 2.b. and 1.
>
> Hi Felix and all,
>
> Here is my suggestion for a note (native speakers please correct):
>
> Bear in mind that ITS is first and foremost a powerful markup technology to
> add metadata to (Web) content. In this sense, it is not a (direct) means to
> support, or even drive process or workflow engines, although some of the
> data categories like provenance, domain, domain mapping, etc. may induce
> such a view. Since this ITS metadata enhances the content in a structured
> way and in multiple forms, ITS consuming agents can employ that data to
> effectively implement their usage or deployment scenarios within single
> engine or single process environments as well as within multi-engine
> environments such as "try MT engine A, then MT engine B, ..." (see also ITS
> 2.0 showcases at
> http://www.w3.org/International/multilingualweb/lt/wiki/Use_cases_-_high_lev
> el_summary#ITS_2.0_Metadata:_Work-In-Context_Showcase).
> It is, however, not possible to assign, say, a specific domain mapping
> incarnation to a certain (process or workflow) instance because such an
> assignment concerns the process side, and this is beyond the current ITS
> metadata scope.
>
> With this, we now have apparently reached consensus on 2.a., 2.b.
> (already reviewed by Christian), and 1. (shepherd's view...)
>
> [@Yves: 1. is independent of the domain mapping specs.]
>
> Cheers -- Jörg
>
> On Jan 29, 2013, at 18:15 (CET), Felix Sasaki wrote:
>> Hi Jan, all,
>>
>> thanks a lot for the initial note, Christian, and for comments in this
>> thread. It seems that Yves made clear that
>>
>> "try MT engine A, then MT engine B"
>>
>> may indeed work with the ITS domain mechanism - but there is a lot of
>> white spaces including
>>
>> "try MT engine A with domain 'financials', then try MT engine B with
>> domain 'healthcare'"
>> and layering of many other processing types. So maybe a final note
>> could concentrate on these white spaces? Anybody volunteering to
>> re-write the note?
>>
>> Best,
>>
>> Felix
>>
>> Am 29.01.13 17:15, schrieb Jan Nelson:
>>> I find it a reasonable practice to define what is not in scope as a
>>> part of any specification, though agree that clear statements of in
>>> scope features are crucial.
>>>
>>> I am curious about how a multi-engine selection/validation process
>>> works.  Christian, you mentioned both TM services as well as MT
>>> engines.  I can see value to be able to call from a set of services
>>> depending on domain with fallback based on result quality scores.
>>> And you state that ITS 2.0 might be a single service scoped spec.
>>>
>>> Yves, you believe that there is support for more than one MT engine
>>> as currently spec'd.  My interest in the white spaces between the two
>>> comments are when layering n-services of differing processing types,
>>> e.g., fuzzy matching TM services versus statistical MT engine results
>>> and how that plays out.  It seems very ambitious to me to provide
>>> scope for this, and yet having a system that is capable of providing
>>> the kinds of metadata needed to enable it would be a pretty powerful
>>> in terms of the potential to provide hi-fi results.
>>>
>>> Maybe my comments are far out of scope, but the thread here caught my
>>> attention.  If this the case, I am happy to discuss it more offline,
>>> perhaps in Rome over a coffee.
>>>
>>> Jan
>>>
>>> ________________________________________
>>> From: Yves Savourel [ysavourel@enlaso.com]
>>> Sent: Tuesday, January 29, 2013 7:55 AM
>>> To: public-multilingualweb-lt-comments@w3.org
>>> Subject: RE: [ISSUE-75] - Domain - 2.a.
>>>
>>> Hi Christian, all,
>>>
>>> I'm always a bit uncomfortable with stating what a mechanism is NOT
>>> doing in a specification. It seems we should be able to define what
>>> it does do and that should be sufficient.
>>>
>>> I would also argue that the scenario "try MT engine A, then MT engine
>>> B" can work perfectly well with what we have today. The specification
>>> provides domainMapping for some basic mappings that allow for example
>>> to point multiple keywords to a more common unique 'domain' label.
>>>
>>> For example you have a mapping as this: domainMapping="'sports law'
>>> Legal, 'property law' Legal"
>>> and two MT engines: they each have a user-defined table that provide
>>> additional re-direction (they are even possibly pair specific: one
>>> maps 'Legal' to 'LEGAL_EN_PT' and the other maps 'Legal' to
>>> '5242e0762354527_legal'.
>>>
>>> Using domainMapping for more than simple grouping is bound to have
>>> quick limitations:
>>>
>>> a) what if you add a third MT engine? You have to edit every single
>>> rules document to add the new mapping?
>>>
>>> b) how do you map to engine that are defined per pair?
>>>
>>> IMO the mapping to the values used to slect the MT engine belongs to
>>> the process side, not the input.
>>>
>>> cheers,
>>> -yves
>>>
>>>
>>>
>>> From: Lieske, Christian [mailto:christian.lieske@sap.com]
>>> Sent: Tuesday, January 29, 2013 8:11 AM
>>> To: public-multilingualweb-lt-comments@w3.org
>>> Subject: [ISSUE-75] - Domain - 2.a.
>>>
>>> Hi,
>>>
>>> One of my comments related to "domain" (see
>>> http://lists.w3.org/Archives/Public/public-multilingualweb-lt-comment
>>> s/2013Jan/0022.html)
>>> was the following:
>>>
>>> 2.a. Domain "systems" may not be harmonized across a processing chain.
>>> A Translation Memory component may for example work with different
>>> domains than a Machine Translation system that is part of the same
>>> processing chain. Since ITS 2.0 "domain" currently does not allow to
>>> capture the information "This is for component X" these scenarios
>>> cannot be addressed.
>>>
>>> During the face-to-face in Prague, we achieved the following status
>>> (see http://www.w3.org/2013/01/23-mlw-lt-minutes.html#item09): a note
>>> should explain that "domain" (and possibly other data categories) do
>>> not accommodate what could be called multi-engine scenario.
>>>
>>> Here is my suggestion for a note ...
>>>
>>> The focus of ITS 2.0, and some of the usage scenarios addressed in
>>> ITS
>>> 2.0 showcases (see
>>> http://www.w3.org/International/multilingualweb/lt/wiki/Use_cases_-_h
>>> igh_level_summary#ITS_2.0_Metadata:_Work-In-Context_Showcase)
>>> is on "single engine" environments. Example: the Machine Translation
>>> (MT) usage scenarios do not work along the lines of process chains
>>> such as "try MT engine A, then MT engine B". Accordingly, ITS 2.0 has
>>> few provisions to support this kind of "multi-engine" environments
>>> which for example require domain-related information such as "try MT
>>> engine A with domain 'financials', then try MT engine B with domain
>>> 'healthcare'".
>>> Cheers,
>>> Christian
>
Received on Tuesday, 5 February 2013 15:52:39 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 9 June 2013 00:25:08 UTC