- From: Tim Chadwick <TChadwick@cstlink.com>
- Date: Thu, 13 Mar 2003 14:42:32 -0500
- To: <html-tidy@w3.org>
- Message-ID: <07D4D31324C87742B85778F6B3EC2344014BE7@ppp-64-115-191-30.broadviewnet.net>
Hi,
I am having an encoding problem
I have a document that is coming in as UTF-16
I am using TidyATL
And this is the config file
doctype: "-//W3C//DTD XHTML 1.1//EN"
indent-spaces: 4
indent: auto
indent-attributes: yes
tidy-mark: no
output-xhtml: yes
output-bom: yes
char-encoding: utf16
this is my input:
<html>
<head>
<META http-equiv="Content-Type" content="text/html; charset=UTF-16">
</head>
<body>
<P>
1970’s
</P>
</body>
</html>
this is the result
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "">
<html>
<head>
<meta http-equiv="Content-Type"
content="text/html; charset=UTF-16" />
<title></title>
</head>
<body>
<p>1970’s</p>
</body>
</html>
the same result holds true if I change the input to UTF-8
and the char-encoding to utf8
there is one warning about a missing title…
the character that is getting encoded incorrectly has Unicode (hex) value 2019
any input is appreciated
thanks
Received on Thursday, 13 March 2003 14:44:39 UTC