Re: Semantic media retrieval UC : an update

From: Ioannis Pratikakis <ipratika@iit.demokritos.gr>
Date: Thu, 23 Nov 2006 13:32:25 +0200
Message-ID: <005b01c70ef3$0c79d020$5603a8c0@iit.demokritos.gr>
To: "Raphael Troncy" <Raphael.Troncy@cwi.nl>
Cc: <public-xg-mmsem@w3.org>
Dear Raphael,

The update of the UC on Semantic media retrieval has been done
by taking into consideration your comments expressed in

You may find in the following, how those comments have been addressed in the update

--- Comment 1 ---
1) I found your examples, not enough "example" ! You still keep a very general and sometimes "vague" level of discourse. To help you, for instance in the Example 3, give us a web page with a picture and its caption. Tell us what kind of information some text analysis techniques could give you. Then tell us how your face analysis will use this information as input to better detect the person of this web page ... In other words, be concrete :-) 

I tried to provide a higher degree of detail in the description of my examples 
Additionally, I have given Figures, where appropriate.
eg. see Fig. 1 

--- Comment 2 ---
 2) Before your motivating examples, you might first discuss the problems you would like to tackle. It seems to me that your concerns are: 
        . How to better do cross-modality analysis, and better exchange the results of each single modality analysis ? see Example 3 
        . How to include some fuzziness in the representation of the analysis results (some degree of confidence), and how to merge this fuzzy information with the true/false knowledge if an ontology ? see Example 2 
        . How to add semantics to the representation of low-level descriptors so they become more exchangable ? see Example 1 

In the new version, I discarded Example 2 and I tried to further develop upon Examples 1, 3.
For clarity purposes, the main problem that needs to be tackled is given as a title at each Example

--- Comment 3 ---
3) I don't get exactly what you mean in your Example 1. When you say that "To enable a semantic interoperability it is not adequate to permit the exchange of low-level features between different users", would you mean, it is not wishable ? or do just remark that in the current situation, MPEG-7 does not allow such an exchange because of its lack of formal semantics ? And what do you suggest for solving this issue: provide a formal semantics to these low-level MPEG-7 descriptors ? Or simply do not exchange this information ? ... 
I don't get after the problems with parts of images. Could you clarify this point ?

In your first question, I mean that there is a lack of formal semantics.
My suggestion to this problem is stated by the updated Solution to Example 1

'The problems with parts of images' was referring to a technical problem with low level feature computation.
There is no need to appear in the updated text. Thus, it is discared.


Again, I would like to thank you for your comments.
Don't hesitate to contact me for any further question.

with my best regards,


