Le jeudi 20 déc. 06 19:06 à 13:35, Tania Estébanez a écrit : > > http://hotwired.goo.ne.jp/news/print/20000414303.html > this page is EUC-JP encoded, i would be u, i'll first transcode it in UTF-8 and then make use of tidy for html to xml conversion. what's the sort of langage u are using for programing ? if Ruby, it's easy within ruby to make EUC-JP to UTF-8 (or UTF-16) transcoding. in ruby (as in perl) their are also method to guess encoding from file input... best, YvonReceived on Wednesday, 20 December 2006 15:07:42 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 18:21:37 GMT