- From: Matthew Stanfield <matthew@propertyknowledge.com>
- Date: Mon, 04 Mar 2002 21:24:55 +0000
- To: html-tidy@w3.org
Hi, Please ignore my earlier email, someone has just told me I need to define all entities in an xml document apart from: & ' > < " [&, ', >, <, "]. Regards, ..matthew Matthew Stanfield wrote: > > Hi, > > I'm using Tidy (specifically TidyCOM) to tidy 'messy' html into valid 'well > formed' xml before using XSLT to retrieve specific bits of data from the > tidied xml. > > The web page is 'successfully' tidied and when I use XML Spy to check for > 'well formedness' it says it is well formed. But when I then apply my XSLT > transformation to it, XML Spy says the "File is not well formed: Reference > to undefined entity 'nbsp'" [the entity is actually as you would expect > " " in the file, it's just the error that says "nbsp" instead of > " ".] > > I can get rid of this problem by setting the tidy option to "quote-nbsp: no" > in my options config file but when I do so I get the same error but with a > different entity, this time "©". > > If I then get rid of the "©" entity manually (there is only one in the > file) then everything works fine. > > Any suggestions as to what I should do would be greatly appreciated. > > Thanks and regards, > > ..matthew
Received on Monday, 4 March 2002 16:25:16 UTC