W3C home > Mailing lists > Public > www-zig@w3.org > March 2002

Re: character encoding assumptions and approaches

From: Ray Denenberg <rden@loc.gov>
Date: Thu, 07 Mar 2002 11:15:34 -0500
Message-ID: <3C879226.4D44358C@loc.gov>
To: www-zig@w3.org
"LeVan,Ralph" wrote:

> Let's change the question slightly.  Why should the application know what
> kind of data it is returning?  Why should it behave differently for one kind
> of data than another?  Did you know that there is text embedded in JPEG
> files?

Actually no, my format experts here tell me that jpeg represents text as bits,
but they might be mistaken. In any case, certainly we wouldn't expect conversion
to utf-8 in mixed-content or print-format (e.g. pdf, postscript)  files.


>   I
> think you assume too much knowlege about MARC records and should treat them
> like any other record format.

If we define a utf-8 option bit, what do you think it should apply to then?

--Ray
Received on Thursday, 7 March 2002 11:14:29 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 29 October 2009 06:12:22 GMT