- From: erik mannens <erik.mannens@ugent.be>
- Date: Fri, 9 May 2008 16:08:28 +0200
- To: <public-video-comments@w3.org>
- Message-ID: <00cb01c8b1de$275de4a0$7619ade0$@mannens@ugent.be>
Hi all, Below you'll find some comments of MMLab regarding the "Media Fragments Working Group": "The Group will focus on developing a mechanism to uniquely identify a temporal fragment within an audio or video object, that is independent of the underlying audio or video codec in use" Pointing to a media fragment by means of a particular time scheme is indeed independent of the coding format and will be based on existing solutions such as MPEG-21 URI or temporal URI. However, since most media available on the web is encoded using a specific coding format, delivering the fragments as identified by the URI is not trivial. Two possibilities exist: . The full media fragment is sent to the client. Since the client has a decoder at its disposal, the client has knowledge about the coding format used. Hence, the media fragment can be partially decoded (i.e., decode only the parts pointed out by the URI). One of the biggest problems with this approach is bandwidth usage. The client does not want to download the full media fragment if he/she only wants to see specific segments. Note that this is especially true for audio/video fragments; for images, this problem could be less relevant (i.e., extract a specific region of the image on client side) . Adapt the full media fragment on the server and send only the necessary fragments to the client. This solves the bandwidth problem but introduces a new problem: a web server does not have knowledge about the underlying coding formats (used to encode the media fragments located on the server). Hence, adaptation logic needs to be present on the server in order to correctly deliver the requested content (via the URI) to the client. Note that in case of using container formats such as MP4, things could go easier since container formats are capable of keeping timestamps associated with the compressed streams. Based on these timestamps, the relevant fragments could be extracted. Next to the requirement of adaptation logic, there is another problem regarding the delivery of the requested URI. Compressing media fragments usually introduces dependencies in the compressed bitstreams. For example, pictures in a video fragment are predicted based on previous (and/or future) pictures. Hence, random access to a compressed bitstream (especially in case of video) is not trivial and requires the availability of random access points (if not, decoding-encoding operations are needed). Furthermore, the random access points in a compressed video fragment won't match (in most cases) with the timestamps specified by the URI. Therefore, delivering partial media fragments of compressed bitstreams will not match exactly with the timestamps specified in the URI. Note that container formats provide solutions for this problem by enabling the possibility to indicate that particular frames are not allowed to be displayed. Sincere greetings, Erik Mannens & Davy Van Deursen Gaston Crommenlaan 8 bus 201 B-9050 Ledeberg-Ghent, Belgium T: +32 9 331 49 93 F: +32 9 331 48 96 M: +32 473 27 44 17 http://multimedialab.elis.ugent.be
Received on Friday, 9 May 2008 14:09:04 UTC