Re: Expressing complex regions with media fragments - use cases + possible solution from Bernhard Haslhofer on 2010-09-08 (public-media-fragment@w3.org from September 2010)

From: Bernhard Haslhofer <bernhard.haslhofer@univie.ac.at>
Date: Wed, 8 Sep 2010 10:01:29 +0200
To: Jack Jansen <Jack.Jansen@cwi.nl>
Cc: public-media-fragment@w3.org, Robert Sanderson <azaroth42@gmail.com>, Simon Rainer <Rainer.Simon@ait.ac.at>
Message-Id: <8F9D9CF5-1170-4D5B-9A26-2CB6D61172E5@univie.ac.at>
Thanks for your comments, Jack.

In our context, the URI of the ref/URI value pair is not necessarily a dereferencable URL; it could also be a URN/UUID, which identifies the resource that provides further information about the segment. That resource is contained in the RDF document that contains the annotation data.

Even if MF is an open standard, it would be great to have some agreed upon extension point for that standard, such as the "ref" parameter...

Best,
Bernhard

On Sep 7, 2010, at 11:28 PM, Jack Jansen wrote:

> Personal opinion ahead.
> 
> I like the idea of using an indirection mechanism to allow media fragments to use more complex addressing than it's native syntax enables. Actually, I think it is probably the only sane solution: encoding complex shapes (or timing constraints, or whatever else) in a URL is bound to lead to ugly constructions.
> 
> That said, there are a number of problems with your suggestion of standardising the ref= scheme.
> First and foremost, it is an elephant-sized hole for cross-site scripting. Imaging an agent checking the hostname of the base url, finding everything safe, handing it off to its url-library, which then happily proceeds to contact the completely unchecked URL in the ref= parameter. Bad idea...
> There are also implementation issues with the agent not knowing whether it supports the format pointed at by the ref= parameter without retrieving the resource. This could be solved through mimetypes (either retrieved from the server, or explicitly coded as ref=image/svg+xml,url). Actually, this is not only an implementation issue but also a standardisation issue: unless we specify the certain mimetypes for ref= must be accepted it is unclear what adding ref= would add in the first place.
> 
> Note that MF is an open-ended standard: there is nothing that disallows more, extra, parameters. So there is an easy way in which you could experiment with this: you just define a "europeana-extended media fragment", where you state that the ref= parameter must also be supported, etc etc etc (addressing the various issues sketched above). This would allow experience to be gained with the format, so that it could be used as input for a future Media Fragments 2.0.
> 
> On 7 sep 2010, at 17:51, Bernhard Haslhofer wrote:
> 
>> Dear all,
>> 
>> we are currently working on media annotation tools for the European digital library (http://www.europeana.eu), which will allow users to contribute their knowledge to digital items. So far we developed four different tools: one for the annotation of images, one for historic maps (which are a special kind of images), one for video, and one for audio annotation. Demos and screencasts are available at: http://dme.arcs.ac.at/annotation/
>> 
>> In all these tools - except in the audio tool of course - we allow the user to define spatial regions in a media object, which are the segments an annotation is actually about. The possible segment shapes range from simple rectangles, over ellipses to polygons within a single media object. Allowing complex segments was a central requirement coming from the users.
>> 
>> We also want to exchange the created annotations as raw data and have chosen the linked data approach to do that. The question is now how to represent annotation data and the targeted segments within a media object as RDF. So far, the Europeana clients follow the Annotea standard (http://www.w3.org/2001/Annotea/) and represent information about the annotated segment in the fragment identifier part of the media object's URI. We are using the syntax defined by the MPEG-21 standard and introduced our own syntax for complex segments.
>> 
>> For interoperability purposes we would like to implement the W3C Media Fragments specification for our addressing our segments. But at the moment we have the problem that the syntax defined for the spatial dimension is insufficient for our use cases.
>> 
>> Therefore I would like to contribute two use-case scenarios and propose a possible technical solution. It would be great, if the specification could support our use cases in foreseeable time.
>> 
>> Regards,
>> Bernhard
>> 
>> 
>> == Use Cases ==
>> 
>> Use Case: Annotating Media Fragments
>> 
>> - Scenario 1: Spatial Annotation of Historic Maps (Images) with complex region-shapes
>> 
>> Rainer annotates a region in an online historical map using a map annotation tool. He draws a polygon around the geographical area he wants to address with his annotation and starts writing a note about this specific region on the map. The system exposes his annotation as an RDF document, where the annotated map region is identified by a media fragment URI.
>> 
>> - Scenario 2: Spatial and Temporal Annotation of Videos with complex region-shapes
>> 
>> Bernhard selects a video sequence (start- & end-point) in an online video and creates a new annotation for that sequence. He draws an ellipse around a specific region in a frame in order to identify the spatial dimension of his annotation. Then he writes a note for this region. The system exposes his annotation as an RDF document, where the annotated map region is identified by a media fragment URI.
>> 
>> 
>> == Possible Technical Solution ==
>> 
>> Since it is hardly possible to address all possible segment shapes in a fragment identification specification, we propose to introduce a new fragment key/value pair for the spatial dimension, which enables fragment identification by reference. The key could be "ptr", "ref" or something similar and the value a URI. The URI points to a resource, which provides further information about the properties of the spatial region/segment.
>> 
>> For example:
>> 
>> http://www.example.com/map1.jpg#ref=http://www.example.com/region/1 addresses a complex segment (polygon) in a map (image)
>> 
>> http://www.example.com/video1.avi#t=10,20&ref=http://www.example.com/region/2 addresses a complex segment (ellipse) within a video sequence
>> 
>> We propose to leave the interpretation of by-reference fragments to the client. In our annotation use cases this information will typically be delivered as part of the annotation RDF document and the fragment nodes (http://www.example.com/region/1, http://www.example.com/region/2) will have types (e.g., xyz:SVGFragment, xyz:MPEG7Fragment, etc.) assigned that indicate how to correctly interpret the information. If clients do not understand the used fragment identification type, they can still fallback and at least display the annotation for the full media object.
>> 
>> 
>> 
>> 
>> 
>> 
>> ______________________________________________________
>> Research Group Multimedia Information Systems
>> Department of Distributed and Multimedia Systems
>> Faculty of Computer Science
>> University of Vienna
>> 
>> Postal Address: Liebiggasse 4/3-4, 1010 Vienna, Austria
>> Phone: +43 1 42 77 39635 Fax: +43 1 4277 39649
>> E-Mail: bernhard.haslhofer@univie.ac.at
>> WWW: http://www.cs.univie.ac.at/bernhard.haslhofer
>> 
>> 
> 
> --
> Jack Jansen, <Jack.Jansen@cwi.nl>, http://www.cwi.nl/~jack
> If I can't dance I don't want to be part of your revolution -- Emma Goldman
> 
> 
> 

______________________________________________________
Research Group Multimedia Information Systems
Department of Distributed and Multimedia Systems
Faculty of Computer Science
University of Vienna

Postal Address: Liebiggasse 4/3-4, 1010 Vienna, Austria
Phone: +43 1 42 77 39635 Fax: +43 1 4277 39649
E-Mail: bernhard.haslhofer@univie.ac.at
WWW: http://www.cs.univie.ac.at/bernhard.haslhofer
Received on Wednesday, 8 September 2010 08:02:02 UTC