Re: [whatwg] VIDEO and pitchAdjustment from David Singer on 2015-09-01 (public-whatwg-archive@w3.org from September 2015)

From: David Singer <singer@apple.com>
Date: Tue, 01 Sep 2015 10:55:52 -0700
To: Yay295 <yay295@gmail.com>
Cc: WHAT Working Group <whatwg@lists.whatwg.org>, Kevin Marks <kevinmarks@gmail.com>, robert@ocallahan.org
Message-id: <7FDB4803-7DFA-4E2E-BDF9-C4EE241089A2@apple.com>

> On Sep 1, 2015, at 10:47 , Yay295 <yay295@gmail.com> wrote:
> 
> On Tue, Sep 1, 2015 at 11:30 AM, David Singer <singer@apple.com> wrote:
> > On Sep 1, 2015, at 4:03 , Robert O'Callahan <robert@ocallahan.org> wrote:
> >> On Tue, Sep 1, 2015 at 8:02 PM, Kevin Marks <kevinmarks@gmail.com> wrote:
> >> QuickTime supports full variable speed playback and has done for well over
> >> a decade. With bidirectionally predicted frames you need a fair few buffers
> >> anyway, so generalising to full variable wait is easier than posters above
> >> claim - you need to work a GOP at a time, but memory buffering isn't the
> >> big issue these days.
> >
> > "GOP”?
> 
> Group of Pictures.  Video-speak for the run between random access points.
> 
> > How about a hard but realistic (IMHO) case: 4K video (4096 x 2160), 25 fps,
> > keyframe every 10s. Storing all those frames takes 250 x 4096 x 2160 x 2
> > bytes = 4.32 GiB. Reading back those frames would kill performance so that
> > all has to stay in VRAM. I respectfully deny that in such a case, memory
> > buffering "isn't a big issue”.
> 
> well, 10s is a pretty long random access interval.
> 
> There's no way to know the distance between keyframes though. The video could technically have only one keyframe and still work as a video.

yes, but that is rare. There are indeed videos that don’t play well backward, or consume lots of memory and/or CPU, but most are fine.

>  
> >> What QuickTime got right was having a ToC approach to video so being able
> >> to seek rapidly was possible without thrashing , whereas the stream
> >> oriented approaches we are stuck with no wean knowing which bit of the file
> >> to read to get the previous GOP is the hard part.
> >
> > I don't understand. Can you explain this in more detail?
> 
> The movie file structure (and hence MP4) has a table-of-contents approach to file structure; each frame has its timestamps, file location, size, and keyframe-nature stored in compact tables in the head of the file. This makes trick modes and so on easier; you’re not reading the actual video to seek for a keyframe, and so on.
> 
> I suppose the browser could generate this data the first time it reads through the video. It would use a lot less memory. Though that sounds like a problem for the browsers to solve, not the standard.

There is no *generation* on the browser side; these tables are part of the file format.

David Singer
Manager, Software Standards, Apple Inc.

Received on Tuesday, 1 September 2015 17:56:19 UTC