- From: Nigel Megitt <nigel.megitt@bbc.co.uk>
- Date: Thu, 9 Nov 2023 17:27:24 +0000
- To: TTWG <public-tt@w3.org>
- Message-ID: <8C3D7B46-25E6-4BEE-9487-31EA8FB5FEC1@bbc.co.uk>
Thanks all for attending today’s TTWG meeting. Minutes can be found in HTML format at https://www.w3.org/2023/11/09-tt-minutes.html In text format: [1]W3C [1] https://www.w3.org/ Timed Text Working Group Teleconference 09 November 2023 [2]Previous meeting. [3]Agenda. [4]IRC log. [2] https://www.w3.org/2023/10/12-tt-minutes.html [3] https://github.com/w3c/ttwg/issues/268 [4] https://www.w3.org/2023/11/09-tt-irc Attendees Present Chris, Cyril, Matt, Mike, Nigel, Pierre Regrets Andreas, Atsushi, Gary Chair Nigel Scribe cpn, nigel Contents 1. [5]This meeting 2. [6]IMSC-HRM 3. [7]DAPT 1. [8]Rework audio description original and translation languages w3c/dapt#179 2. [9]Add inline Registry Section w3c/dapt#196 4. [10]AOB: 5G-MAG OSMART workshop 5. [11]Meeting close Meeting minutes This meeting Nigel: Today, we have IMSC-HRM, DAPT and an AOB about 5G-MAG OSMART workshop … Is there any other business, or points for us to make sure we cover within those topics? group: no AOB IMSC-HRM Nigel: I think we set a timeline, where we'd be looking to transition to the next maturity stage about now … which would be PR, so we'd need to meet CR exit criteria … That requires evidence of implementation, either a validating or an authoring implementation (as demonstrated by content) … I've had three organisations contact me that they'd run content through HRM with good results Pierre: One was 25,000 documents Nigel: If your validating implementation passes the tests, Pierre, we can demonstrate everything we need for CR exit? Pierre: I'm confident that we've demonstrated implementation experience Mike: At ATSC, we ran both positive and negative content, and it flagged every document Nigel: That shows the HRM conceptually is useful … and also the effectiveness of the tool used to check it … If we can say those files were generated with a different implementation from the ones that passed, … we can say we have evidence with multiple pots of content, and some of the authoring applications meet the requirements of HRM … But if they're all produced by the same tool, it doesn't demonstrate implementation, but it does demonstrate effectiveness of HRM Mike: We used two encoders, hundreds of documents Nigel: So to demonstrate success, although the ones that failed demonstrate utility of HRM, we only want the ones that passed to demonstrate success Pierre: We want the ones we know pass to pass, and the ones that should fail, fail Mike: If everything passes, you don't really know anything Cyril: Discussed with Pierre last week, related to failures … I ran 3000 Netflix content through, and none failed. More recently testing live content, a naive implementation of 608 conversion … It failed. The reason, if you naively convert 608 to TTML, you can have two characters change every 33 ms, it was failing the HRM Mike: As it should. We had similar experience, the encoder was doing 608 conversion, badly Cyril: But why should it fail? If an implementation of 608 can refresh every 33 ms, why shouldn't a TTML implementation do the same? … It means the HRM as configured today has some known limitations, but is that ok? … I'm not convinced it's not too strict … Our implementation of naive conversion, it plays fine in the Netflix players Mike: I have a different view. Where content was flagged as failiing, it was creating and tearing down content in a single document roughly every 20-30ms … Decoders weren't built to do that. I think it's an unreasonable thing to expect implementations to handle … If we want redesign HRM to handle those things, OK … I'd be concerned about relaxing that, as there are TVs that can't display it Cyril: Not saying we should relax it. It's based on parameters, we chose one set. It doesn't necessarily mean the document can't be rendered, just on those parameters Pierre: TTML isn't 608, different data model, rendering model. 608 is a state machine … The client is a state machine with a cursor, send commands to make the cursor do things. You have multiple channels you can go between ambiguously <Zakim> pal, you wanted to react to a previous speaker Pierre: TTML is a document with semantics, line breaks, regions. The two models are entirely different … The idea of converting 608 to TTML and expecting the same thing, is nonsensical … I think that was recognised early on. Regardless of performance, trying to convert 608 verbatim to a TTML document, there's no expectation you'd get exactly what the 608 was Mike: I agree with that Pierre: The second point is, at the time there was significant limitation in terminals … Nothing close to Android on TVs. The model was created so all reasonable TVs would have a shot at rendering TTML … No expectation that a TV could render 30 or 60 TTML documents a second, which is the rate 608 transmits data … In general, I'm not suprised it would fail on naive translation of 608 content Nigel: Don't disagree with any of the above … A naive view would be that merely appending 2 characters every frame is something that a modern renderer should easily achieve … I think that mischaracterises the problem and the solution space. A couple of extra things need to happen … With TTML you have to do layout again, you can't just append 2 characers at a know grid location, so additional processing to do … Secondly, the 608 and similar models from that era are based on a stateful decoder, where they append to what's already there … Intent of how we deliver TTML is the receiver shouldn't have to maintain state Mike: There's not a fundamental problem converting 608 to IMSC1 and fully meeting the HRM. I checked 3 encoding manufacturers, everything worked great … One particular manufacturer, on one document. They reconstructed the screen 30x per second by adding 2 characters … Caused the TV render the whole thing again. It was horrible, so should die though HRM violation Cyril: Don't disagree it's a bad idea to convert 608 to TTML where you refresh only 2 characters … The one thing I think we should be careful about, if we'd defined a simple TTML profile to match 608, it would work … The other features are there, make it more complex for the implemntation. That's how we should explain it, people currently do 608 conversion … Not good for us to say that that's nonsense Mike: I agree, perhaps the HRM should have a number of dials. An expectation of complexity, rather than the W3C TTML group deciding what complex means for all time … TVs are getting better at handling this stuff … What should be prohibited today may be fine in a few years … The concept of a profile, or adjusting the complexity, depending on the application is a good idea … Need to put a stake in the ground today … Going forward, maybe we need to think of future HRM having dials for no. of characters per section, no. of changes, to serve needs multiple of applications … It's good for today's technology, but make it smarter for the future Pierre: I think naive translation of 608 in TTML is a bad idea, you get TTML documents that aren't portable and you can't match the TTML feature set … If the goal is to allow a straight translation from 608, we're far from that … Assuming that's not the goal, does the current HRM prevent creation of captions that meet the needs of end users? … If the answer is no, we don't need additional controls … Goal is a caption experience required by end users Cyril: I'm not suggesting changing the HRM. We could add a note to say that the TTML and IMSC specs allow more complex features than previous technologies, so the parameters we've chosen don't allow for characters to change in every video frame … Useful to have that in the spec Mike: I have 25 year old MPEG TS that exercises every 608 feature, 2 of 3 encoders get it right Cyril: But they don't refresh the screen every frame Mike: I'm ok with the principle, we need to agree what the HRM checks for at a high level, but need to be careful about 608, it can be done <Zakim> nigel, you wanted to come back to CR exit, when this discussion is done Nigel: Add an issue to add an informative note on potential causes of HRM failure and their mitigations … So pause CR exit until we've agreed wording … We do need to be careful about the language and the reflection on the industry … Aside from that, I think we have enough information to claim we've met CR exit criteria, so we need to write an implementation report … An action for somebody Pierre: I'm happy to write the report, from the wiki page Nigel: Need to be careful about people who described implementations group: I'll take point on checking in with them to see if they're happy to be publicly named. DAPT Nigel: A few things on DAPT … Horizontal review requests. APA has taken an action to do the review, hope to have response from them soon … TAG has gone quiet, and no feedback on security … I'll take an action to follow those up … We sent liaisons, and some haven't responded yet. We'll proceed regardless, for the time being Rework audio description original and translation languages [12]w3c/dapt#179 [12] https://github.com/w3c/dapt/issues/179 github: [13]w3c/dapt#179 [13] https://github.com/w3c/dapt/pull/179 Nigel: This has gone through some iterations. Andreas has been actively reviewing … The point to deal with is flagging in the best way the difference between content described that had an inherent language, and content that didn't … I added a table in the issue with permutations … It's clear if you're transcribing dialog or signing … For content in the video image, it might be text or might not be. If text, can use lang source … If it's a description of the image, or a sound effect, those things don't have a language … The proposal is for any content with a language, use lang source to record it, and the language of the text transcripted in xml lang … And omit lang source if there isn't a language … Also the concept of original language doesn't serve a purpose any more, and could be removed Cyril: On the proposal to be clear on when lang source is present or not, that's fine … Disagree that we don't need the original language … It's to provide context on what the language was. When you're dubbing a show with multiple languages in the original source … or a show translated to a pivot language to create a dubbing, knowing the orignal language can be useful Nigel: That's a change of status of the flag. There are normative statements around original lang to change: make it a list … if someone creates a silent video with no dialog, then it's audio described, what use is orignal lang? Cyril: Doesn't have to be mandatory. The concept of original language should be preserved in the spec … or original languages … If the original languages are French and English, you can translate to English only, French only, or German … If translation to German, you translate everything, but if translating to French you only translate the English parts … It's complex, I need to continue to review. Seems going in the right direction Mike: I try to get people to conform to BCP47. If you constrain to that, it's good. Use the IANA codes Nigel: We do, yes Nigel: To summarise, the proposal seems OK. Original language still has a use, but we can relax some of its requirements: make it non-mandatory and allow multiple values, if that's the best reprsentation of the content SUMMARY: the proposal seems OK. Original Language still has a use, but we can relax some of its requirements: make it non-mandatory and allow multiple values, if that's the best representation of the content Add inline Registry Section [14]w3c/dapt#196 [14] https://github.com/w3c/dapt/issues/196 github: [15]w3c/dapt#196 [15] https://github.com/w3c/dapt/pull/196 Nigel: Thank you Cyril for reviewing. Please take a look … We wanted registry tables in a few places, so this uses new features in W3C Process. They can be updated outside the normal Rec track process … I've based this on the boilerplate text in the TTWG repo … In doing this, I've had to raise a Process issue, but it's probably acceptable now to meet process requirements … Please look, to see if it has any issues SUMMARY: Please review! AOB: 5G-MAG OSMART workshop Chris: I'll paste a link into IRC. [16]Invitation on TTWG member list [16] https://lists.w3.org/Archives/Member/member-tt/2023Nov/0000.html Chris: This came to me from Thomas Stockhammer, who I assume is part of the organising … group for this conference. It's 5G-MAG with DVB and has been extended to other organisations … like DASH-IF, CTA-WAVE, ATSC, 35PP, W3C and others. … Open invitation to share in the workshop anything about reference tools that have been … developed to support specifications. … Intent is to foster collaboration between standards groups. … I thought of two areas of interest: IMSC-HRM and its tooling, and the imscJS implementation, … and some other things like in MediaWG sample code for WebCodecs, … or similar for WebTransport WG. … Thomas latched onto the IMSC and TTML parts more so than the others. … If you would like to present on the tooling that you've developed around IMSC and TTML, and the HRM, … this is an opportunity to present at that workshop. … What I'd like to do is to go back to either confirm or politely decline depending on what you prefer to do. Nigel: It looks to me like it could be useful Pierre: I can follow up, it doesn't have to be a group thing? Nigel: No, I don't think so. Chris: If you wanted to frame this as an official TTWG thing we could go down the route of adding a logo, … but we can maybe treat it less formally. Pierre: I can email Thomas and copy Chris and Nigel and gauge the interest by the response, then … we can figure it out. Nigel: How soon do they need a response? Chris: It's planned for 6th and 7th December so quite soon. Pierre: I'll send something today. Meeting close Nigel: Thank you everyone, and Chris for scribing. Have a good rest of day, let's adjourn now. [adjourns meeting] Minutes manually created (not a transcript), formatted by [17]scribe.perl version 221 (Fri Jul 21 14:01:30 2023 UTC). [17] https://w3c.github.io/scribe2/scribedoc.html
Received on Thursday, 9 November 2023 17:28:01 UTC