- From: Lee Passey <lee@novonyx.com>
- Date: Tue, 04 Sep 2001 09:13:30 -0600
- To: html-tidy@w3.org
Lee Passey wrote: > > (2) in TrimSpaces(), no check is made for text nodes which have trimmed > into oblivion. > > I presume newer versions of tidy should include these fixes, so I am > including here the diffs from the 8-2000 version that I used to > accomplish this. > I made a slight mistake here; the 8-2000 version of tidy _does_ delete text nodes which have been trimmed into oblivion _if_ they are not attached to a td or th node. (I presume there are good reasons for maintaining zero length text nodes for those tags, although for the life of me I can't figure out what it would be). In any case, here are the revised diffs for parser.c which fix the mis-trimmed entity. 289d288 < /*! NOTE: is utf-8 encoded as two bytes */ 295,299d293 < if ( (unsigned char)(lexer->lexbuf[last->end - 1]) == 0xc2 < && c == 0xa0) < { < last->end -= 1; < } 304,308d297 < if ( (unsigned char)(lexer->lexbuf[last->end - 1]) == 0xc2 < && c == 0xa0) < { < last->end -= 1; < } 390d378 < { 392,394d379 < if (text->start == text->end) < TrimEmptyElement( lexer, text ); < } 399d383 < { 401d384 < }
Received on Tuesday, 4 September 2001 11:11:22 UTC