- From: Bjoern Hoehrmann <derhoermi@gmx.net>
- Date: Fri, 25 May 2001 00:40:47 +0200
- To: "Randy Waki" <rwaki@flipdog.com>
- Cc: <html-tidy@w3.org>
* Randy Waki wrote:
>> i am trying to converting an html document to xml. In the intial, my
>> document conversion was fine until when i used XT parser to parse the
>> resulting xml ouput . I have the obtained the following error.
>>
>> xml:154: reference to undefined entity "nbsp"
>Using your config file, I get a character with hex value A0, which is
>correct for latin-1 encoding. I don't know why you're getting " ".
>However, if I delete the "quote-nbsp=no" from your config file, I get
>" " instead, which may be what you want (we use Tidy to output
>XHTML this way and it has been working just fine).
Title:
Tidy inserts entity in -xml mode.
Example:
% tidy -xml
<p> 
^Z
<p> </p>
Comment:
No config file. I already reported this bug some months ago.
Possible solutions:
* let pprint.c:PPrintChar() check for XmlOut or
* let -xml set NumEntities = yes
Patch for the first solution:
% diff -u -p -u ..\original\pprint.c pprint.c
--- ..\original\pprint.c Fri Jul 28 17:57:56 2000
+++ pprint.c Fri May 25 00:34:06 2001
@@ -404,7 +404,7 @@ static void PPrintChar(uint c, uint mode
{
AddC('&', linelen++);
- if (NumEntities)
+ if (NumEntities || XmlOut)
{
AddC('#', linelen++);
AddC('1', linelen++);
PS: As usual, against 04 August Release.
--
Björn Höhrmann { mailto:bjoern@hoehrmann.de } http://www.bjoernsworld.de
am Badedeich 7 } Telefon: +49(0)4667/981028 { http://bjoern.hoehrmann.de
25899 Dagebüll { PGP Pub. KeyID: 0xA4357E78 } http://www.learn.to/quote/
Received on Thursday, 24 May 2001 18:39:29 UTC