Re: XHTML character entity support

On Nov 13, 2009, at 14:00, James Graham wrote:

> John Cowan wrote:
>> James Graham scripsit:
>>> Note that Anne did some work in this area already:
>> That's interesting, although a little crude: some people at Extreme Markup
>> some years back presented a much cleverer algorithm for schemaless tag
>> recovery, given a tree to work with.  Unfortunately, the archives seem to be
>> offline.
> 
> I would be interested in seeing that, if you can dig up some kind of reference.
> 
> Note that a requirement is that the algorithm not need to use lookahead; it must be possible to implement an incremental, error handling, parser.

I had assumed that implementability as a truly streaming SAX parser was also an implicit requirement. (Hence, "given a tree to work with" would be unacceptable.)

-- 
Henri Sivonen
hsivonen@iki.fi
http://hsivonen.iki.fi/

Received on Friday, 13 November 2009 12:03:49 UTC