W3C home > Mailing lists > Public > www-validator@w3.org > January 2002

Re: Macintosh charset blowing up?

From: Martin Duerst <duerst@w3.org>
Date: Sat, 05 Jan 2002 17:11:31 +0900
Message-Id: <4.2.0.58.J.20020105170639.03ad1990@localhost>
To: Nick Kew <nick@webthing.com>, <www-validator@w3.org>, Gerald Oskoboiny <gerald@w3.org>
Cc: <nhtcapri@rrzn-user.uni-hannover.de>
At 00:22 02/01/05 +0000, Nick Kew wrote:

>In news:040120022151192238%andreas.prilop@altavista.net,
>Andreas Prilop reports an apparent problem:
>
> > The W3C validator somehow "swallows" characters 246 and 247
> > in Macintosh encoding:
> > 
> <http://validator.w3.org/check?uri=www.unics.uni-hannover.de/nhtcapri/test.htm>
> > <http://www.unics.uni-hannover.de/nhtcapri/test.htm>
>
>
>On investigation, I find that iconv to macintosh to utf-8 runs
>apparently successfully,

I just did wget and iconv on the validator.w3.org machine.
iconv -f MACINTOSH -t UTF-8 <test.htm produced the null bytes.
It is clearly a bug in iconv.

For iconv --version, I get:

iconv (GNU libc) 2.1.3
Copyright (C) 1999 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
Written by Ulrich Drepper.

Gerald, I suggest we upgrade to the newest version of iconv
and then see whether the problems are gone.

Regards,    Martin.

>but the utf-8 generated appears to contain
>spurious null bytes at the characters Andreas highlights,
>and in any case causes OpenSP to choke.
>
>Can anyone throw further light on this?
>
>
>--
>Nick Kew
>
>Site Valet - the mark of Quality on the Web.
><URL:http://valet.webthing.com/>
Received on Saturday, 5 January 2002 05:58:19 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:00 GMT