Re: Parsing and breaking an HTML document into pieces

> Hi James,
> 
> I kind of remember that you created a script to parse HTML 5  
> specification and break into pieces? Using [html5lib][1] probably?

I think that was me instead

> Do you have an handy link for the script?

http://html5.googlecode.com/svn/trunk/spec-splitter/spec-splitter.py

> Did you choose to break on specific heading levels?

It breaks on <h2>, <h3>, and a few hard-coded extra headings ('video', 
'the-canvas', 'the-command', 'tokenisation', 'tree-construction')

> Best.

-- 
Philip Taylor
pjt47@cam.ac.uk

Received on Tuesday, 4 December 2007 08:22:13 UTC