W3C home > Mailing lists > Public > www-lib@w3.org > January to March 2003

Starting to implement IDN for libwww and Amaya

From: Martin Duerst <duerst@w3.org>
Date: Sun, 23 Feb 2003 21:03:07 -0500
Message-Id: <4.2.0.58.J.20030223204353.050ea948@localhost>
To: www-lib@w3.org

Dear Libwww community,

The IETF IDN (Internationalized Domain Name) Working Group
(http://www.ietf.org/html.charters/idn-charter.html)
has worked for a long time, and has mostly completed its
work. The relevant RFCs are supposed to be released
shortly.

Together with Jose, I have started to check how to implement
IDN in libwww (and therefore in Amaya). Please expect to
see some new code over the next few weeks. Please don't
hesitate to ask questions and give comments.

To give you some additional background, the plan is to
accept IRIs encoded in UTF-8 or %-escaped (based on UTF-8)
domain names and to do the conversion to punycode (the
very specialized encoding used in IDN) in HTGetHostByName
in HTDNS.c. On my machine, I have the %-escaping part
already working (i.e. http://www.w%33.org resolves to the
same place as http://www.w3.org). Well, I have to admit
that this was the easy part :-).

Currently, the plan is to either use
http://www.gnu.org/software/libiconv/ as a separate
library or to use code from idnkit-1.0pr2 at
http://www.nic.ad.jp/ja/idn/mdnkit/download/#sources.
The former is probably more straightforward, but the
later comes with less strings attached. If you have
any preference, please tell me.


Regards,   Martin.

#-#-#-#-#-#-#-#
Dr.sc. Martin J. Du"rst

Internationalization Activity Lead
World Wide Web Consortium (W3C)

Visiting Scientist
Massachusetts Institute of Technology
Laboratory for Computer Science

200 Technology Square
Cambridge, MA 02139
USA
Tel:  +1 617 253 5509
Fax:  +1 617 258 5999

mailto:duerst@w3.org
http://www.w3.org/
http://www.w3.org/People/D%c3%bcrst
#-#-#-#-#-#-#-#
Received on Sunday, 23 February 2003 21:16:40 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 23 April 2007 18:18:43 GMT