- From: (wrong string) çois Yergeau <yergeau@alis.com>
- Date: Thu, 24 Aug 2000 12:57:30 -0400
- To: <www-amaya@w3.org>
Dave J Woolley wrote:
> [DJW:] I said served as text/html, which implies
> the use of an HTTP server and text/html content-type.
I don't see that in your message. You wrote things such as "compatible with
the HTML default character set" and "For HTML compatibility, I would
therefore say that Amaya should be defaulting ISO 8859/1" and "Amaya is an
[X]HTML tool, it should default to 8859/1". I felt it was important to
debunk this purported HTML default charset, which doesn't exist.
> [DJW:] That's not the same thing. No encoding means
> UTF-8 or UTF-16.
If you mean no encoding *declaration* in XML, then yes (unless there is
external information, such as an HTTP header, that says otherwise).
> Amaya should store documents as
> ISO 8859/1 unless told otherwise, as that is what most
> users will expect;
Well, that's what it does, no? Apart from the fact that this is not a very
good idea (a better default would be something based the platform's
locale/charset, still better a user preference), the problem is that Amaya
currently saves XHTML with inconsistent encoding declarations, unless you
set a language. If I save a new document, I now see:
<?xml version="1.0"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title>No title</title>
<meta name="GENERATOR" content="amaya V3.2.1" />
<meta http-equiv="Content-Type" content="text/html" />
</head>
The XML declaration, lacking an encoding decl., implies that the encoding is
UTF-8 while the document is actually saved in 8859-1. The <meta> comes too
late in the <head> (it should come before <title>) and anyway is totally
useless; if I can parse that far, I already know it's text/html! As Vincent
indicated, only if I set a language do I get correct declarations.
> What apparently is happening is that it is defaulting to
> UTF-8, although I suspect it only works correctly if the
> display also uses UTF-8.
I don't see that, how do get this behaviour? What I see is the doc stored
in 8859-1 but the XML declaration lying about it. If I set a language att
on <html>, the doc is still stored in 8859-1 but the declarations now
correctly say so.
--
François Yergeau
Received on Thursday, 24 August 2000 13:01:15 UTC