CVS html5/html-sourcing-inband-tracks

Update of /sources/public/html5/html-sourcing-inband-tracks
In directory roscoe:/tmp/cvs-serv7310

Modified Files:
	Overview.html 
Log Message:
Sync HTML sourcing inband tracks spec with Git commit 1895989d34dc9acf46df8df5737f89431d631509

--- /sources/public/html5/html-sourcing-inband-tracks/Overview.html	2014/06/07 08:07:00	1.4
+++ /sources/public/html5/html-sourcing-inband-tracks/Overview.html	2014/08/03 12:21:54	1.5
@@ -64,7 +64,7 @@
 }
 
 code {
-    color:  #ff4500;
+    color:  #C83500;
 }
 
 /* --- TOC --- */
@@ -182,7 +182,7 @@
   </p>
   <h1 class="title p-name" id="title" property="dcterms:title">Sourcing In-band Media Resource Tracks from Media Containers into HTML</h1>
   
-  <h2 property="dcterms:issued" datatype="xsd:dateTime" content="2014-06-06T22:03:42.000Z" id="unofficial-draft-07-june-2014">Unofficial Draft <time class="dt-published" datetime="2014-06-07">07 June 2014</time></h2>
+  <h2 property="dcterms:issued" datatype="xsd:dateTime" content="2014-08-03T00:08:01.000Z" id="unofficial-draft-03-august-2014">Unofficial Draft <time class="dt-published" datetime="2014-08-03">03 August 2014</time></h2>
   <dl>
     
     
@@ -265,14 +265,14 @@
       </p>
       <p>
         For every MIME-type/subtype of an existing media container format, this specification defines the following information:
-        </p><ol>
-          <li>Track order.</li>
-          <li>How to identify the type of tracks.</li>
-          <li>Setting track attributes 'id', 'kind', 'language' and 'label' for sourced text tracks.</li>
-          <li>Setting track attributes 'id', 'kind', 'language' and 'label' for sourced audio and video tracks.</li>
-          <li>Mapping text track content into text track cues.</li>
-        </ol>
-      <p></p>
+      </p>
+      <ol>
+        <li>Track order.</li>
+        <li>How to identify the type of tracks.</li>
+        <li>Setting track attributes 'id', 'kind', 'language' and 'label' for sourced text tracks.</li>
+        <li>Setting track attributes 'id', 'kind', 'language' and 'label' for sourced audio and video tracks.</li>
+        <li>Mapping text track content into text track cues.</li>
+      </ol>
     </section>
 
     <section id="mpegdash" typeof="bibo:Chapter" resource="#mpegdash" rel="bibo:Chapter">
@@ -280,26 +280,25 @@
       <b>MIME type/subtype: application/dash+xml</b>
 
       <ol>
-        <li>Track Order
+        <li><p>Track Order</p>
           <p>
             The order of tracks specified in the MPD (Media Presentation Description) format [<cite><a class="bibref" href="#bib-MPEGDASH">MPEGDASH</a></cite>] is maintained when sourcing multiple MPEG DASH tracks into HTML.
           </p>
         </li>
 
-        <li>Determining the type of track
+        <li><p>Determining the type of track</p>
           <p>
-            A user agent recognises and supports data from a MPEG DASH media resource as being equivalent to a HTML track based on the adaptationSet mimeType:
-            </p><ul>
-              <li>text track: the mimeType is of main type "application" or "text"</li>
-              <li>video track: the mimeType is of main type "video"</li>
-              <li>audio track: the mimeType is of main type "audio"</li>
-            </ul>
-          <p></p>
+            A user agent recognises and supports data from a MPEG DASH media resource as being equivalent to a HTML track based on the AdaptationSet or ContentComponent mimeType:
+          </p>
+          <ul>
+            <li>text track: the mimeType is of main type "application" or "text"</li>
+            <li>video track: the mimeType is of main type "video"</li>
+            <li>audio track: the mimeType is of main type "audio"</li>
+          </ul>
         </li>
 
-        <li>Track Attributes for sourced Text Tracks
-          <p>
-          </p><table>
+        <li><p>Track Attributes for sourced Text Tracks</p>
+          <table>
             <thead>
               <tr><th>Attribute</th>
               <th>How to source its value</th>
@@ -307,7 +306,7 @@
             <tbody><tr>
               <th>id</th>
               <td>
-                Content of the 'id' attribute in the AdaptationSet element. Empty string if 'id' attribute is not present.
+                Content of the 'id' attribute in the AdaptationSet or ContentComponent element. Empty string if 'id' attribute is not present.
               </td>
             </tr>
             <tr>
@@ -330,16 +329,14 @@
             <tr>
               <th>language</th>
               <td>
-                Content of the 'lang' attribute in the AdaptationSet element.
+                Content of the 'lang' attribute in the AdaptationSet or ContentComponent element.
               </td>
             </tr>
           </tbody></table>
-          <p></p>
         </li>
 
-        <li>Track Attributes for sourced Audio and Video Tracks
-          <p>
-          </p><table>
+        <li><p>Track Attributes for sourced Audio and Video Tracks</p>
+          <table>
             <thead>
               <tr><th>Attribute</th>
               <th>How to source its value</th>
@@ -347,7 +344,7 @@
             <tbody><tr>
               <th>id</th>
               <td>
-                Content of the 'id' attribute in the AdaptationSet element. Empty string if 'id' attribute is not present.
+                Content of the 'id' attribute in the AdaptationSet or ContentComponent element. Empty string if 'id' attribute is not present.
               </td>
             </tr>
             <tr>
@@ -377,15 +374,57 @@
             <tr>
               <th>language</th>
               <td>
-                Content of the 'lang' attribute in the AdaptationSet element.
+                Content of the 'lang' attribute in the AdaptationSet or ContentComponent element.
               </td>
             </tr>
           </tbody></table>
-          <p></p>
         </li>
 
-        <li>Mapping text track content into text track cues
-          <p>TBD</p>
+        <li><p>Mapping text track content into text track cues</p>
+          <p>
+            DASH text track data may be contained in files or encapsulated in MPEG media container formats ISOBMFF and MPEG-2 TS. DASH text track data contained in a file may be in the WebVTT or TTML format [<cite><a class="bibref" href="#bib-ISO14496-30">ISO14496-30</a></cite>], 3GPP Timed Text format [<cite><a class="bibref" href="#bib-3GPP-TT">3GPP-TT</a></cite>], or other format. Encapsulated DASH text track data may be in the format defined in this specification for <a href="#ISOBMFF-TT">ISOBMFF text track</a> or <a href="#MPEG2TS-TT">MPEG-2 TS text track</a>. 
+          </p>
+          <p>
+            DASH text track data is in a file in the WebVTT format if the file MIME type is "text/vtt" and should be exposed as a VTTCue as defined in [<cite><a class="bibref" href="#bib-WEBVTT">WEBVTT</a></cite>]. DASH text track data is in a file in the TTML format if the file MIME type is "application/ttml+xml" and should be exposed as an as yet to be defined TTMLCue. Alternatively, browsers can also map the TTML features to WebVTTCue objects. Finally, browsers that cannot render TTML [<cite><a class="bibref" href="#bib-ttaf1-dfxp">ttaf1-dfxp</a></cite>] format data should expose them as DataCue objects [<cite><a class="bibref" href="#bib-HTML5">HTML5</a></cite>]. In this case, the TTML file must be parsed in its entirety and then converted into a sequence of TTML Intermediate Synchronic Documents (ISDs). Each ISD creates a DataCue object with attributes sourced as follows: 
+            </p><p>
+              </p><table>
+                <thead>
+                  <tr><th>Attribute</th>
+                  <th>How to source its value</th>
+                </tr></thead>
+                <tbody><tr>
+                  <th>id</th>
+                  <td>Decimal representation of the ‘id’ attribute of the ‘head’ element in the XML document. Null if there is no ‘id’ attribute.</td>
+                </tr>
+                <tr>
+                  <th>startTime</th>
+                  <td>
+                    Value of the beginning media time of the active temporal interval of the ISD.
+                  </td>
+                </tr>
+                <tr>
+                  <th>endTime</th>
+                  <td>
+                    Value of the ending media time of the active temporal interval of the ISD.
+                  </td>
+                </tr>
+                <tr>
+                  <th>pauseOnExit</th>
+                  <td>"false"</td>
+                </tr>
+                <tr>
+                  <th>text</th>
+                  <td>The (UTF-16 encoded) character array composing the ISD resource.</td>
+                </tr>
+              </tbody></table>
+            <p></p>
+          <p></p>
+          <p>
+            DASH text track data is encapsulated in an ISOBMFF container format if the MIME type is "application/mp4" (ISOBMFF container with one track) or "video/mp4" (ISOBMFF container with multiple tracks). Text track data in this format should be exposed following the same rules as for <a href="#ISOBMFF-TT">ISOBMFF text track</a>.
+          </p>
+          <p>
+            DASH text track data is encapsulated in an MPEG-2 TS container format if the MIME type is "video/mp2t" (MPEG-2 TS container with multiple elementary streams). Text track data in this format should be exposed following the same rules as for <a href="#MPEG2TS-TT">MPEG-2 TS text track</a>.
+          </p>
         </li>
       </ol>
 
@@ -396,30 +435,29 @@
       <b>MIME type/subtype: audio/mp2t , video/mp2t</b>
 
       <ol>
-        <li>Track Order
+        <li><p>Track Order</p>
           <p>
             Tracks are called "elementary streams" in a MPEG-2 Transport Stream (TS) [<cite><a class="bibref" href="#bib-MPEG2TS">MPEG2TS</a></cite>]. The order in which elementary streams are listed in the "Program Map Table" (PMT) of a MPEG-2 TS is maintained when sourcing multiple MPEG-2 tracks into HTML.
           </p>
         </li>
 
-        <li>Determining the type of track
+        <li><p>Determining the type of track</p>
           <p>
             A user agent recognises and supports data from a MPEG-2 TS resource as being equivalent to a HTML track based on the value of the 'stream_id' field of an elementary stream as given in a Transport or Program Stream header and which maps to a "stream type":
-            </p><ul>
-              <li>text track:
-                <ul>
-                  <li>The elementary stream with PID 0x02 or the stream type value is "0x02", "0x05" or between "0x80" and "0xFF". </li>
-                  <li>The CEA 708 caption service [<cite><a class="bibref" href="#bib-CEA708">CEA708</a></cite>] in the 'Picture User Data' of a video stream, as identified by a 'Caption Service Descriptor' [<cite><a class="bibref" href="#bib-ATSC65">ATSC65</a></cite>] in the 'Elementary Stream Descriptors' in the PMT entry for a video stream with stream type 0x02.</li>
-                </ul>
-              </li><li>video track: the stream type value is "0x01", "0x02", "0x10", "0x1B", or between "0x1E" and "0x23"</li>
-              <li>audio track: the stream type value is "0x03", "0x04", "0x0F", "0x11", or "0x1C"</li>
-            </ul>
-          <p></p>
+          </p>
+          <ul>
+            <li>text track:
+              <ul>
+                <li>The elementary stream with PID 0x02 or the stream type value is "0x02", "0x05" or between "0x80" and "0xFF". </li>
+                <li><dfn id="captionservice">The CEA 708 caption service</dfn> [<cite><a class="bibref" href="#bib-CEA708">CEA708</a></cite>], as identified by a 'Caption Service Descriptor' [<cite><a class="bibref" href="#bib-ATSC65">ATSC65</a></cite>] in the 'Elementary Stream Descriptors' in the PMT entry for a video stream with stream type 0x02 or 0x1B.</li>
+              </ul>
+            </li><li>video track: the stream type value is "0x01", "0x02", "0x10", "0x1B", or between "0x1E" and "0x23"</li>
+            <li>audio track: the stream type value is "0x03", "0x04", "0x0F", "0x11", or "0x1C"</li>
+          </ul>
         </li>
 
-        <li>Track Attributes for sourced Text Tracks
-          <p>
-          </p><table>
+        <li><p>Track Attributes for sourced Text Tracks</p>
+          <table>
             <thead>
               <tr><th>Attribute</th>
               <th>How to source its value</th>
@@ -446,26 +484,24 @@
             <tr>
               <th>label</th>
               <td>
-                The empty string.
+                The DOMString representation of the 'component_name_string' field in the 'component_name_descriptor' [<cite><a class="bibref" href="#bib-ATSC65">ATSC65</a></cite>] found immediately after the 'ES_info_length' field in the Program Map Table [<cite><a class="bibref" href="#bib-MPEG2TS">MPEG2TS</a></cite>]. The empty string if the 'component_name_descriptor' is not present.
               </td>
             </tr>
             <tr>
               <th>language</th>
               <td>@kind is
                 <ul>
-                  <li>'captions': Content of the 'language' field for the caption service in the 'Caption Service Descriptor'.</li>
-                  <li>'subtitles': Content of the 'ISO_639_language_descriptor' in the elementary stream descriptor array in the PMT.</li>
-                  <li>'metadata': The empty string.</li>
+                  <li>"captions": Content of the 'language' field for the caption service in the 'Caption Service Descriptor'.</li>
+                  <li>"subtitles": Content of the 'ISO_639_language_code' field in the 'ISO_639_language_descriptor' in the elementary stream descriptor array in the PMT.</li>
+                  <li>"metadata": The empty string.</li>
                 </ul>
               </td>
             </tr>
           </tbody></table>
-          <p></p>
         </li>
 
-        <li>Track Attributes for sourced Audio and Video Tracks
-          <p>
-          </p><table>
+        <li><p>Track Attributes for sourced Audio and Video Tracks</p>
+          <table>
             <thead>
               <tr><th>Attribute</th>
               <th>How to source its value</th>
@@ -482,7 +518,7 @@
                 <ul>
                   <li>"alternative": not used</li>
                   <li>"captions": not used</li>
-                  <li>"descriptions": AC3 audio in MPEG-2 TS: bsmod=2 and full_svc=0</li><!-- see http://www.atsc.org/cms/pdf/bootcamp/PSIP_Captions_rev2.pdf -->
+                  <li>"descriptions": AC3 audio in MPEG-2 TS [<cite><a class="bibref" href="#bib-ATSC52">ATSC52</a></cite>]: bsmod=2 and full_svc=0</li><!-- see http://www.atsc.org/cms/pdf/bootcamp/PSIP_Captions_rev2.pdf -->
                   <li>"main": first audio (video) elementary stream in the PMT</li>
                   <li>"main-desc": AC3 audio in MPEG-2 TS: bsmod=2 and full_svc=1</li>
                   <li>"sign": not used</li>
@@ -496,158 +532,142 @@
             <tr>
               <th>label</th>
               <td>
-                The empty string.
+                The DOMString representation of the 'component_name_string' field in the 'component_name_descriptor' [<cite><a class="bibref" href="#bib-ATSC65">ATSC65</a></cite>] found immediately after the 'ES_info_length' field in the Program Map Table [<cite><a class="bibref" href="#bib-MPEG2TS">MPEG2TS</a></cite>]. The empty string if the 'component_name_descriptor' is not present.
               </td>
             </tr>
             <tr>
               <th>language</th>
-              <td>
-                Content of the 'ISO_639_language_descriptor' field.
+              <td>@kind is:
+                <ul>
+                  <li> "descriptions" or "main-desc": Content of the 'language' field in the 'AC-3_audio_stream_descriptor' [<cite><a class="bibref" href="#bib-ATSC52">ATSC52</a></cite>].</li>
+                  <li> otherwise: Content of the 'ISO_639_language_code' field in the 'ISO_639_language_descriptor'.</li>
+                </ul>
               </td>
             </tr>
           </tbody></table>
-          <p></p>
         </li>
-
-        <li>Mapping text track content into text track cues
+        <li><p><dfn id="MPEG2TS-TT">Mapping text track content into text track cues for MPEG-2 TS</dfn></p>
           <p>
             MPEG-2 transport streams may contain data that should be exposed as cues on 'captions', 'subtitles' or 'metadata' text tracks. No data is defined that equates to 'descriptions' or 'chapters' text track cues.
           </p>
-          <p>
-            </p><ol type="a">
+          <ol type="a">
+            <li><p>Metadata cues</p>
               <p>
-              </p><li>Metadata cues
-                <p>
-                  Cues on an MPEG-2 metadata text track are created as DataCue objects [<cite><a class="bibref" href="#bib-HTML5">HTML5</a></cite>]. Each 'section' in an elementary stream identified as a text track creates a DataCue object with its TextTrackCue attributes sourced as follows:
-                </p>
-                <table>
-                  <thead>
-                    <tr><th>Attribute</th>
-                    <th>How to source its value</th>
-                  </tr></thead>
-                  <tbody><tr>
-                    <th>id</th>
-                    <td>
-                      Decimal representation of the 'table_id' in the first 8 bits of the 'section' data.
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>startTime</th>
-                    <td>
-                      0
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>endTime</th>
-                    <td>
-                      The time, in the media resource timeline, that corresponds to the presentation time of the video frame received immediately prior to the 'section' in the media resource.
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>pauseOnExit</th>
-                    <td>
-                      'false'
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>data</th>
-                    <td>
-                      The 'section_length' number of bytes immediately following the 'section_length' field in the 'section'.
-                    </td>
-                  </tr>
-                </tbody></table>
-              </li>
-              <p></p>
-              <p>
-              </p><li>Captions cues
-                <p>
-                  Cues on an MPEG-2 captions text track are created as DataCue objects [<cite><a class="bibref" href="#bib-HTML5">HTML5</a></cite>]. MPEG-2 captions data is in the CEA 708 format [<cite><a class="bibref" href="#bib-CEA708">CEA708</a></cite>]. Each 'service block' in an digital TV closed caption (DTVCC) transport channel creates a DataCue object with TexTrackCue attributes sourced as follows:
-                </p>
-                <table>
-                  <thead>
-                    <tr><th>Attribute</th>
-                    <th>How to source its value</th>
-                  </tr></thead>
-                  <tbody><tr>
-                    <th>id</th>
-                    <td>
-                      Decimal representation of the 'service_number' in the 'service_block'.
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>startTime</th>
-                    <td>
-                      The time, in the HTML media resource timeline, that corresponds to the presentation time stamp for the video frame that contained the first 'Caption Channel Data Byte' of the 'service_block'. 
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>endTime</th>
-                    <td>
-                      The sum of the startTime and 4 seconds.
-                      <div class="note"><div class="note-title" aria-level="2" role="heading" id="h_note_4"><span>Note</span></div><p class="">
-                        CEA 708 captions do not have an explicit end time - a rendering device derives the end time for a caption based on subsequent caption data. Setting endTime equal to startTime might be more appropriate but this would require better support for zero-length cues, as proposed in <a href="https://www.w3.org/Bugs/Public/show_bug.cgi?id=25693">HTML Bug 25693</a>.
-                      </p></div>
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>pauseOnExit</th>
-                    <td>
-                      'false'
-                    </td>
-                  </tr>
-                  <tr>
-                    <th>data</th>
-                    <td>
-                      The 'service_block'.
-                    </td>
-                  </tr>
-                </tbody></table>
+                Cues on an MPEG-2 metadata text track are created as DataCue objects [<cite><a class="bibref" href="#bib-HTML5">HTML5</a></cite>]. Each 'section' in an elementary stream identified as a text track creates a DataCue object with its TextTrackCue attributes sourced as follows:
+              </p>
+              <table>
+                <thead>
+                  <tr><th>Attribute</th>
+                  <th>How to source its value</th>
+                </tr></thead>
+                <tbody><tr>
+                  <th>id</th>
+                  <td>
+                    Decimal representation of the 'table_id' in the first 8 bits of the 'section' data.
+                  </td>
+                </tr>

[514 lines skipped]

Received on Sunday, 3 August 2014 12:21:56 UTC