W3C home > Mailing lists > Public > www-amaya@w3.org > January to March 2004

Re: a bug? Chinese webpage 'not xhtml"

From: Zhang Weiwu <zhangweiwu@realss.com>
Date: Wed, 11 Feb 2004 16:51:26 +0800
Message-ID: <4029ED0E.8070206@realss.com>
To: Irene Vatton <irene.vatton@inrialpes.fr>
Cc: www-amaya@w3.org

Irene Vatton wrote:

>On Wed, 11 Feb 2004 14:40:03 +0800
>Zhang Weiwu <zhangweiwu@realss.com> wrote:
>>This is the first time I use Amaya. It seems Amaya's I18N support is not 
>>very good. (did I miss something?)
>>To repeat the bug:
>>* Open this url with Amaya 8.3: http://aliweekly.nease.net/040209.html
>>you get the "Not Well-Formed XML document - Reload as HTML or show 
>>parsing errors?"
>>* Click "show", you get something like this screenshot
>>I cannot determine what part is wrong on this screen, there are no clear 
>>But the URL [http://aliweekly.nease.net/040209.html], like most my other 
>>webpages, is validate.
>>Isn't the validator in Amaya the same as on http://validator.w3.org ?
>Amaya works with unicode characters.
>We have "jisx0201", "jisx0208", "jisx0212", and gb2312" tables that convert these 
>characters into unicode characters, but we don't have a table for "gb18030" characters.
>The XML parser (expat) tries to read this gb18030 document as a utf-8 document, so it 
>stops at the first non-utf-8 character.
>I agree the parser message is not clear. We'll change the code to report the error before 
>launching the parser.
>By the way, could you point us to a conversion table for "gb18030" characters?
I don't have a gb18030 conversion table, but this article provided good 
explaination and method to convert gb18030 to/from unicode;

>    Irene.
>Irène Vatton                     INRIA Rhône-Alpes
>INRIA                               ZIRST
Oh you are from INRIA. Nice to meet you. Scilab is really a good tool 
from INRIA, and I'm hosting two plug-in project for Scilab 
(scilabanywhere and vrscilab on sf.net:) Do we have many people here 
from INRIA?
Received on Wednesday, 11 February 2004 03:51:50 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:30:43 UTC