W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2003

Re: [Tidy-dev] Annoying Symbol At Start Of XML Outputted File.

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Fri, 04 Apr 2003 07:01:06 +0200
To: Matthew Stanfield <mattstan@blueyonder.co.uk>
Cc: tidy-develop@lists.sourceforge.net, html-tidy@w3.org
Message-ID: <3ebc0463.182932613@smtp.bjoern.hoehrmann.de>

* Matthew Stanfield wrote:
>When tidying html and outputting as xml, there is a symbol that is 
>appearing at the start of my XML files, ascii value is 0x98. How do I stop 
>it appearing?
>
>I assume this is the 'unicode Byte Order Mark character' that is mentioned 
>in the Tidy configuration options reference.

No, the byte order mark would be 00 00 FE FF, FF FE 00 00, 00 00 FF FE,
FE FF 00 00, FE FF, FF FE or EF BB BF depending on the encoding, 0x98
never indicates a byte order mark.

>The char is stopping tidy output as xml from being read correctly by .net's 
>C# XPathDocument class. When I manually remove the char all works fine.
>
>I am using Charles Reitzel's COM/ATL dll.

If you are unable to reproduce this using the command line tool or the
pure C interface, please ask Charles privately (or wherever he may
provide support to this) and please don't crosspost between
<html-tidy@w3.org> and <tidy-develop@lists.sourceforge.net>.
Received on Friday, 4 April 2003 00:01:21 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:54 GMT