W3C home > Mailing lists > Public > www-international@w3.org > April to June 2003

Re: Looking for UTF-8 *plain text* multilingual sample page and text editor

From: Tex Texin <tex@i18nguy.com>
Date: Tue, 01 Apr 2003 18:53:46 -0500
Message-ID: <3E8A268A.26FE80B7@i18nguy.com>
To: "Kurosaka, Teruhiko" <Teruhiko.Kurosaka@iona.com>
CC: "Www-International (E-mail)" <www-international@w3.org>

Hi,
My Unicode Example page:

http://www.i18nguy.com/unicode/unicode-example-intro.html

has a link to a zip file of a comma separated value file (ie plain text list)
of example data in UTF-8:

http://www.i18nguy.com/unicode/unicode-example-utf8.zip

The intent was so for the file contents to be available to be easily loaded
into databases, spreadsheets, etc so it could be used for experimentation and
testing.

hth

tex



"Kurosaka, Teruhiko" wrote:
> 
> Hello,
> I am looking for a sample plain text file that has a few lines of sample text
> in different languages (at minimum, English, Chinese, Japanese and Korean)
> in UTF-8.  I have found some HTML samples such as below but I could not find
> an equally good one in plain text format:
> http://www.columbia.edu/kermit/utf8.html
> http://crism.maden.org/dunno.html
> 
> I tried to convert one of the above to a plain text by eliminating tags and
> extra text but my editor (Meadow, Mule based Windows specialized Emacs
> editor) complains it cannot be UTF-8, probably because the text contains
> some byte sequence it cannot understand for whatever reason.
> 
> I am also looking for a multilingual text editor that can read in
> plain text file in popular encodings of CJK and UTF-8.  It does not
> need to have any fancy functions.  Read, Write and Cut and Paste
> would be enough.  Hopefully, it should have a builtin font so that
> it can work on English Windows w/o extra font installation.
> If you can recommend any, let me know.
> 
> T. "Kuro" Kurosaka
> Internationalization Architect
> teruhiko.kurosaka@iona.com
> -------------------------------------------------------
> IONA Technologies
> 2350 Mission College Blvd. Suite 650
> Santa Clara, CA 95054
> Tel: (408) 350 9684/9500
> Fax: (408) 350 9501
> -------------------------------------------------------
> Making Software Work Together TM

-- 
-------------------------------------------------------------
Tex Texin   cell: +1 781 789 1898   mailto:Tex@XenCraft.com
Xen Master                          http://www.i18nGuy.com
                         
XenCraft		            http://www.XenCraft.com
Making e-Business Work Around the World
-------------------------------------------------------------
Received on Tuesday, 1 April 2003 18:55:24 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:00 GMT