/ Alex Milowski <> was heard to say:
| At the end of our e-mail discussion in May I suggested we have a separate
| step for parsing HTML.  I still think this is a good idea.  Anyone else?

So this is the equivalent of "tidy" not the equivalent of "tagsoup",

I guess I'm ok with this, but I wonder if we'll need a
vocabulary-agnostic cleanup step too. Maybe not.

I guess the next step is to propose a specific step with a description
and the options you think it needs.

