W3C home > Mailing lists > Public > www-international@w3.org > January to March 2005

Migrating legacy multilingual data to utf-8

From: Deborah Cawkwell <deborah.cawkwell@bbc.co.uk>
Date: Wed, 23 Feb 2005 12:37:10 -0000
Message-ID: <418B7E44473AC34488C9E730D09FF3CF05179052@bbcxue204.national.core.bbc.co.uk>
To: <www-international@w3.org>
We have legacy multilingual data stored in a Postgres database.

In our database text is typically stored in the charcater encoding
in which it was entered. This typically corresponds with the language of
the text, e.g. Czech: windows-1250; Chinese: gb2312

We wish to take more advantege of Unicode, and hence we are considering
migrating the data to UTF-8.

What is the best way to do this?

Any advice, experience would be welcome.


This e-mail (and any attachments) is confidential and may contain
personal views which are not the views of the BBC unless specifically
If you have received it in error, please delete it from your system. 
Do not use, copy or disclose the information in any way nor act in
reliance on it and notify the sender immediately. Please note that the
BBC monitors e-mails sent or received. 
Further communication will signify your consent to this.
Received on Wednesday, 23 February 2005 12:37:13 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:40:50 UTC