W3C home > Mailing lists > Public > public-i18n-geo@w3.org > November 2003

Re: New FAQ: Removing UTF-8 BOM

From: Jungshik Shin <jshin@i18nl10n.com>
Date: Wed, 5 Nov 2003 23:39:34 +0900 (KST)
To: Martin Duerst <duerst@w3.org>
Cc: public-i18n-geo@w3.org
Message-ID: <Pine.LNX.4.58.0311052336000.12721@jshin.net>

On Wed, 5 Nov 2003, Martin Duerst wrote:

> It can even be typed directly, as:
>
> prompt>  perl -pi~ -0777 -e "s/^\xEF\xBB\xBF//s;" filewithbom.html

  Well, this doesn't work with Perl 5.6 or later because in Perl 5.6
or later, the native representation of characters is UTF-8. Even in
earlier Perl, it has a problem of removing U+FEFF at places other than
the very beginning of files.

  Jungshik
Received on Wednesday, 5 November 2003 09:39:37 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 January 2008 14:12:38 GMT