- From: Felix Sasaki <fsasaki@w3.org>
- Date: Sat, 10 Nov 2012 06:37:32 +0100
- To: public-multilingualweb-lt@w3.org
- Message-ID: <CAL58czrGutnR7_gpuQZBB_QhDYe8gLe-J8VAXWHHZjQ4zuz21Q@mail.gmail.com>
are at http://www.w3.org/2012/11/05-mlw-lt-minutes.html and below as text.
Apologies for the delay.
Felix
[1]W3C
[1] http://www.w3.org/
- DRAFT -
MultilingualWeb-LT Working Group Teleconference
05 Nov 2012
See also: [2]IRC log
[2] http://www.w3.org/2012/11/05-mlw-lt-irc
Attendees
Present
Pedro, Marcis, daveL, Jirka, DomJones, leroy, Yves_,
Ankit, tadej, omstefanov, kfritsche, Arle, fantasai,
shaunm, marcis, naoto, milan
Regrets
davidF, Felix
Chair
dave
Scribe
daveL
Contents
* [3]Topics
1. [4]agenda
2. [5]Standoff markup
3. [6]its-tools
* [7]Summary of Action Items
__________________________________________________________
agenda
[8]http://lists.w3.org/Archives/Public/public-multilingualweb-l
t/2012Nov/0024.html
[8] http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2012Nov/0024.html
[9]http://lists.w3.org/Archives/Public/public-multilingualweb-l
t/2012Nov/0026.html
[9] http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2012Nov/0026.html
topic; Doodle poll about virtual f2f
<tadej> [10]http://doodle.com/heh7k59h7vkvnv88#table
[10] http://doodle.com/heh7k59h7vkvnv88#table
<tadej> daveL: poll shows 27th and 28th to be both good
candidates
<tadej> ... I would suggest taking the 27th and 28th, having
both around 3 hour calls in the afternoon
<tadej> ... howerver, we should deal with more specific issues
beforehand
<tadej> daveL: Tuesday, Nov 20th is also a good candidate
<tadej> ACTION: daveL to confirm November 20, 27 and 28 as
virtual session dates [recorded in
[11]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action01]
<trackbot> Created ACTION-278 - Confirm November 20, 27 and 28
as virtual session dates [on David Lewis - due 2012-11-12].
topic; upcoming meetings
[12]http://www.w3.org/International/multilingualweb/lt/wiki/Mai
n_Page#Upcoming
[12] http://www.w3.org/International/multilingualweb/lt/wiki/Main_Page#Upcoming
<tadej> daveL: checking if the schedule makes sense - so far
Prague 23-24 Jan, Rome 12-13 March, Bled 7-8 May, and Madrid
still unspecified
<tadej> daveL: as for events, there's a GALA event, LocWorld,
the WWW conference in Rio, and the LRC conference in Limerick
<tadej> Yves_: the only thing we need to fix is the dates for
the Madrid meeting, since July is a holiday month
<Arle> We may be able to get on the GALA program. I will know
more soon.
<tadej> Pedro: For July, the sooner the better, ideally first
week
<tadej> ... or even last week of June
<tadej> ACTION: daveL to open doodle poll for Madrid dates (end
June - beginning July) [recorded in
[13]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action02]
<trackbot> Created ACTION-279 - Open doodle poll for Madrid
dates (end June - beginning July) [on David Lewis - due
2012-11-12].
<Arle> (Separate from what Pedro has already submitted, which
is a great start.)
Standoff markup
topic; standoff markup
[14]http://lists.w3.org/Archives/Public/public-multilingualweb-
lt/2012Nov/0019.html
[14] http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2012Nov/0019.html
<tadej> Yves_: we should use a single root element, like
its:standOffList (or similarly named). the inclusion mechanism
would be via the script element, either inline or separate file
<tadej> ...given the example, it would be better to split the
standoff into two separate <script>-s, and have the script
element id match the standoff list ids.
<tadej> Pedro: the external files can be problematic in cases
with real-time translation
<tadej> daveL: do you think the its:rules elements could be the
enclosing element?
<tadej> Yves_: since we need to point to multiple
its:standofflists, they can't be the root element, since they
could exist in the same file; its:rules could be a root.
<tadej> daveL: could you correct the schema so it takes this
into account?
<tadej> Yves_: mixing rules and standoff can get messy
<tadej> daveL: its:rules is easy from the conformance point of
view, easier to explain, although there may be confusion
<tadej> Jirka: there's conceptual overload with this - we'd be
declaring its:rules, and it wouldn't contain actual rules, but
standoff info
<tadej> daveL: let's summarize having a single element
its:standoffList having an id attribute which matches the
script element's id.
<tadej> ... in external files, we could have multiple standoff
lists
<tadej> ACTION: Yves_ to edit the spec to unique standoff
markup [recorded in
[15]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action03]
<trackbot> Sorry, couldn't find Yves_. You can review and
register nicknames at
<[16]http://www.w3.org/International/multilingualweb/lt/track/u
sers>.
[16] http://www.w3.org/International/multilingualweb/lt/track/users%3E.
<tadej> ACTION: Yves to edit the spec to unique standoff markup
[recorded in
[17]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action04]
<trackbot> Created ACTION-280 - Edit the spec to unique
standoff markup [on Yves Savourel - due 2012-11-12].
its-tools
[18]http://lists.w3.org/Archives/Public/public-multilingualweb-
lt/2012Nov/0004.html
[18] http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2012Nov/0004.html
<tadej> daveL: Marcis sent an update consolidating MT
confidence and TA Annotation into simpler definitions
<tadej> ... there's still an open issue on whether defining
its:tools should be compulsory for these two data categories.
any opinions?
<tadej> Yves_: sounds reasonable
<tadej> daveL: I'll modify the text and make it compulsory.
<tadej> daveL: Marcis also pointed out that several tools could
process a fragment of text, which makes things confusing. it's
different than MT, since you're annotating an annotation.
<tadej> ... should we then just apply the its:tool to those
data categories than have it as a separate data category?
<tadej> tadej: disambiguation could survive that, it's
equivalent
[19]http://lists.w3.org/Archives/Public/public-multilingualweb-
lt/2012Nov/0006.html
[19] http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2012Nov/0006.html
<scribe> scribe: daveL
tadej: is currently updating its-tools, looking at use of
non-its annotations
<tadej> daveL: right now we have a mechanism to identify to
which data category it applies to, allowing for user-defined
names
<tadej> daveL: ... since you're borrowing the mechanism anyway,
you're out of conformance anyway
<tadej> daveL: we could remove it, since we don't have a formal
extension mechanism
<Marcis> I hear you, I just cannot say anything
<tadej> tadej: if we define a per-datacategory confidence
attribute, how to express multi-valued attributes?
<Marcis> I mean, if the domains are automatically identified,
then you will have a confidence (if the systems will return
probabilistic results)
<Marcis> As tadej said - the weighted mechanism says that there
is a confidence
<tadej> tadej: It boils down to whether that number is useful
for the consumer
<Marcis> The categories (not in exact names...) that I see
requiring the confidence are: MT, Terminology, Domain
segmentation tools (are there any currently used by the MT use
cases?), Named Entity Recognition (currently in Disambiguation,
right?), others (?)
<tadej> ACTION: daveL to ask for use cases of data
category-specific confidence scores [recorded in
[20]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action05]
<trackbot> Created ACTION-281 - Ask for use cases of data
category-specific confidence scores [on David Lewis - due
2012-11-12].
<Ankit> w.r.t. confidence scores in MT, they are are mainly
used in a post-editing environment, i.e. when a human
translator uses these scores to determine which outputs of a MT
system they want to correct..
<tadej> tadej: disambiguation can produce scores, but not
commonly used
<tadej> daveL: its:tools has its own element, the
its:standOffList - we should describe it how it works within a
script element, so it's as similar as possible to the XML
markup.
Summary of Action Items
[NEW] ACTION: daveL to ask for use cases of data
category-specific confidence scores [recorded in
[21]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action05]
[NEW] ACTION: daveL to confirm November 20, 27 and 28 as
virtual session dates [recorded in
[22]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action01]
[NEW] ACTION: daveL to open doodle poll for Madrid dates (end
June - beginning July) [recorded in
[23]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action02]
[NEW] ACTION: Yves to edit the spec to unique standoff markup
[recorded in
[24]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action04]
[NEW] ACTION: Yves_ to edit the spec to unique standoff markup
[recorded in
[25]http://www.w3.org/2012/11/05-mlw-lt-minutes.html#action03]
[End of minutes]
__________________________________________________________
Minutes formatted by David Booth's [26]scribe.perl version
1.137 ([27]CVS log)
$Date: 2012/11/10 05:36:17 $
[26] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/scribedoc.htm
[27] http://dev.w3.org/cvsweb/2002/scribe/
Received on Saturday, 10 November 2012 05:37:56 UTC