[whatwg] SRT research: timestamps

On 06/10/11 01:58 AM, Simon Pieters wrote:

> I don't know how many have negative interval, I'd need to run a new
> script over the 52,000,000 lines to figure out. (If you want me to check
> this, please contact me with details about what you want to count as
> "negative interval".)

I had in mind something like:

cat *.vtt | awk '/-->/ {
  ns = split($1, start, "[:.,]");
  ne = split($3, end, "[:.,]");
  if (ns != ne) print "timestamp field counts differ";
  if (end[1] -start[1] < 0) print "negative interval";
}
BEGIN { negs = 0; misses = 0; }
END { print negs, "negative intervals";
      print misses, "cues skipped because field counts were different";
}'

Which will probably still miscount some garbage lines, but gives a rough
idea.

> leading id e.g.
> 10300:11:53,891 --> 00:11:56,155
> 
> 33

OTOH, sounds like the leading id issue is vanishingly uncommon, so I'm
just curious if there are any other queues which would be rejected that way.

 -r

Received on Friday, 7 October 2011 10:27:29 UTC