Re: HTML Parsing Step?

/ Alex Milowski <> was heard to say:
| At the end of our e-mail discussion in May I suggested we have a separate
| step for parsing HTML.  I still think this is a good idea.  Anyone else?

So this is the equivalent of "tidy" not the equivalent of "tagsoup",

I guess I'm ok with this, but I wonder if we'll need a
vocabulary-agnostic cleanup step too. Maybe not.

I guess the next step is to propose a specific step with a description
and the options you think it needs.

                                        Be seeing you,

Norman Walsh <> | A wonder is often expressed that the            | greatest criminals look like other men.
                              | The reason is that *they are like other
                              | men in many respects.*-- Hazlitt

Received on Tuesday, 3 July 2007 12:02:32 UTC