The clipBegin attribute specifies the beginning of a sub-clip of a continuous media object as offset from the start of the media object. This offset is measured in normal media playback time from the beginning of the media.
CASE1:
<ref src="media1" clipBegin="smpte=01:02:00:00" clipEnd="smpte=01:06:00:00"/>
media1 |---------+++++++++++++++++++++---------| 8-minute-long
01:00:.. 01:02:00:00 01:06:.. 01:08:00:00 embedded TCs in media1
ref |+++++++++++++++++++| 4-minute-long
00:00:00:00 00:04:00:00
CASE2:
<ref src="media2" clipBegin="smpte=00:02:00:00" clipEnd="smpte=00:06:00:00"/>
media2 empty // |-------------------| 8-minute-long
00:02:.. 00:06:.. // 01:00:00:00 01:08:00:00 embedded TCs in media2
ref |---------| // 4-minute-long
00:00:.. 00:04:.. //
All of these media elements are semantically identical. When playing back a media object, the player must not derive the exact type of the media object from the name of the media object element.
<par> <video src="media1"/> <audio src="media2"/> </par>
by XPointer-like URI fragment:
<audio src="media1#track('A1')" cast="left"/>
by attribute:
<audio src="media1" track="A1" cast="left"/>
<video src="media2" coords="0%,0%,50%,100%"/> <seq coords="50%,0%,100%,100%"> <video src="media1"/> <video src="media2"/> </seq>
<par alpha="0.5"> <video src="media1" alpha="0.8"/> <video src="media2" alpha="0.2"/> </par>
<ref src="media1" begin="smpte=01:00:00:00" end="smpte=01:05:00:00"/>
media1 |+++++++++++++++++++|
03:00:00:00 03:04:00:00 embedded TCs in media1
01:00:00:00 01:04:00:00 applied TCs to ref element
<seq begin="smpte=01:00:00:00">
<ref src="media1"/>
<ref src="media2"/>
</seq>
media1 |+++++++++++++++++++|
03:00:00:00 03:04:00:00 embedded TCs in media1
media2 |+++++++++++++++++++|
02:00:00:00 02:04:00:00 embedded TCs in media2
seq |+++++++++++++++++++++++++++++++++++++++|
01:00:00:00 01:04:00:00 01:08:00:00 applied TCs to seq element
<ref src="media0" clipBegin="smpte=01:02:00:00" clipEnd="smpte=01:06:00:00"/>
Given media0 as a par element below:
<par>
<ref src="media1" begin="smpte=01:00:00:00"/>
<ref src="media2" begin="smpte=01:03:00:00"/>
</par>
to be equivalent to a par element below:
<par clipBegin="smpte=01:02:00:00" clipEnd="smpte=01:06:00:00">
<ref src="media1" begin="smpte=01:00:00:00"/>
<ref src="media2" begin="smpte=01:03:00:00"/>
</par>
<head>
<metadata type="text/xml">
<mp7:mpeg7 xmlns:mp7="urn:mpeg:mpeg7:...">
....
</mp7:mpeg7>
</metadata>
<metadata type="application/rdf+xml">
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
....
</rdf:RDF>
</metadata>
</head>
Why not ? <par transOut="crossfade"> <ref src="media1"/> <ref src="media2"/> </par> rather than <par> <ref src="media1" transOut="crossfade"/> <ref src="media2" transOut="crossfade"/> </par>