Encoding...

Hi,
I am having an encoding problem
 
I have a document that is coming in as UTF-16
I am using TidyATL
And this is the config file
 
doctype: "-//W3C//DTD XHTML 1.1//EN"
indent-spaces: 4
indent: auto
indent-attributes: yes
tidy-mark: no
output-xhtml: yes
output-bom: yes
char-encoding: utf16
 
this is my input:
 
<html>
<head>
<META http-equiv="Content-Type" content="text/html; charset=UTF-16">
</head>
<body>
<P>
1970’s 
</P>
</body>
</html>
 
this is the result
 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "">
<html>
<head>
    <meta http-equiv="Content-Type"
          content="text/html; charset=UTF-16" />
    <title></title>
</head>
<body>
    <p>1970’s</p>
</body>
</html>
 
the same result holds true if I change the input to UTF-8
and the char-encoding to utf8
 
there is one warning about a missing title…
 
the character that is getting encoded incorrectly has Unicode (hex) value 2019
 
any input is appreciated
thanks
 
 
 

Received on Thursday, 13 March 2003 14:44:39 UTC