- From: Robert Sanderson <azaroth@liverpool.ac.uk>
- Date: Thu, 7 Mar 2002 16:44:31 +0000 (GMT)
- To: Ray Denenberg <rden@loc.gov>
- cc: <www-zig@w3.org>
> > of data than another? Did you know that there is text embedded in JPEG
> > files?
>
> Actually no, my format experts here tell me that jpeg represents text as bits,
> but they might be mistaken. In any case, certainly we wouldn't expect conversion
> to utf-8 in mixed-content or print-format (e.g. pdf, postscript) files.
To be pedantic, there is plain text embedded in some jpeg files.
For example:
'Created with The GIMP'
These same comments can be found in GIF, but also the GIF version
information is in plain text. For example:
GIF89a
This is text and it is in graphic files, along with a Whole Lot of other
information.
In theory these comments could be searched. If it were mandated that the
utf-8 option bit means that text in all records, including external ones,
should be returned in utf-8, then there would be serious problems.
This is the point Ralph was trying to make, I think.
Rob
--
,'/:. Rob Sanderson (azaroth@liverpool.ac.uk)
,'-/::::. http://www.o-r-g.org/~azaroth/
,'--/::(@)::. Special Collections and Archives, extension 3142
,'---/::::::::::. Twin Cathedrals: telnet: liverpool.o-r-g.org 7777
____/:::::::::::::. WWW: http://liverpool.o-r-g.org:8000/
I L L U M I N A T I
Received on Thursday, 7 March 2002 11:48:31 UTC