W3C home > Mailing lists > Public > html-tidy@w3.org > January to March 2003

Encoding...

From: Tim Chadwick <TChadwick@cstlink.com>
Date: Thu, 13 Mar 2003 14:42:32 -0500
Message-ID: <07D4D31324C87742B85778F6B3EC2344014BE7@ppp-64-115-191-30.broadviewnet.net>
To: <html-tidy@w3.org>
Hi,
I am having an encoding problem
 
I have a document that is coming in as UTF-16
I am using TidyATL
And this is the config file
 
doctype: "-//W3C//DTD XHTML 1.1//EN"
indent-spaces: 4
indent: auto
indent-attributes: yes
tidy-mark: no
output-xhtml: yes
output-bom: yes
char-encoding: utf16
 
this is my input:
 
<html>
<head>
<META http-equiv="Content-Type" content="text/html; charset=UTF-16">
</head>
<body>
<P>
1970’s 
</P>
</body>
</html>
 
this is the result
 
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "">
<html>
<head>
    <meta http-equiv="Content-Type"
          content="text/html; charset=UTF-16" />
    <title></title>
</head>
<body>
    <p>1970’s</p>
</body>
</html>
 
the same result holds true if I change the input to UTF-8
and the char-encoding to utf8
 
there is one warning about a missing title…
 
the character that is getting encoded incorrectly has Unicode (hex) value 2019
 
any input is appreciated
thanks
 
 
 
Received on Thursday, 13 March 2003 14:44:39 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:53 GMT