W3C home > Mailing lists > Public > www-zig@w3.org > March 2002

RE: Z39.50 character encoding

From: LeVan,Ralph <levan@oclc.org>
Date: Fri, 1 Mar 2002 08:49:29 -0500
Message-ID: <E5431CF93E29F9478878F623E5B9CE9802DD9162@OA3-SERVER.oa.oclc.org>
To: www-zig@w3.org
UTF-8 is the default characterset for XML.  It is possible to specify a
different characterset.

Ralph

> -----Original Message-----
> From: Alan Kent [mailto:ajk@mds.rmit.edu.au]
> Sent: Thursday, February 28, 2002 7:35 PM
> To: www-zig@w3.org
> Subject: Re: Z39.50 character encoding
> 
> 
> On Thu, Feb 28, 2002 at 09:13:20AM -0500, Johan Zeeman wrote:
> > DC by itself is not a record syntax; it is a list of data 
> elements.  To be a
> > record syntax, the data elements need to be encoded using 
> some scheme.  The
> > one I know about is XML.  And XML explicitly uses UTF-8.
> > 
> > j.
> 
> Just to clarify, do you mean the XML record syntax in Z39.50 
> explicitly
> uses UTF-8? XML itself certainly *does not* explicitly use UTF-8.
> That is simply what is common. People do use other encodings with
> XML (UTF-16 for example is completely valid and in usage - for
> example when using Chinese or other scripts, UTF-16 encoded files
> are much smaller than the same UTF-8 encoded files).
> 
> I was just curious (without re-reading the XML record syntax) whether
> it was a Z39.50 decree that the XML record syntax mandates 
> UTF-8 encoding.
> 
> Thanks,
> Alan
> 
Received on Friday, 1 March 2002 08:50:05 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 29 October 2009 06:12:22 GMT