- From: Jirka Kosek <jirka@kosek.cz>
- Date: Thu, 08 Dec 2005 23:13:17 +0100
- To: "Jukka K. Korpela" <jkorpela@cs.tut.fi>
- Cc: www-validator@w3.org
- Message-ID: <4398AFFD.7010007@kosek.cz>
Jukka K. Korpela wrote: > Apparently the validator uses UTF-8 as the implied default. > > The choice is impractical I can't recall RFC number from the top of my head, but HTTP protocol assumes ISO-8859-1 for all text/* media types as a default. So it is no "impractical", it is clearly bug. That's why text/xml was superseded by application/xml where is no such default assumed. If there were no charset parameter, ISO-8859-1 should be assumed from HTTP point of view, but XML document without XML declaration assumes UTF-8 or UTF-16. However HTTP takes precedence and you are decoding XML content with a wrong encoding assumption. Not good. It sounds silly to serve XML with other content type then text/*, but legacy is legacy :-( Jirka -- ------------------------------------------------------------------ Jirka Kosek e-mail: jirka@kosek.cz http://www.kosek.cz ------------------------------------------------------------------ Profesionální školení a poradenství v oblasti technologií XML. Podívejte se na náš nově spuštěný web http://DocBook.cz Podrobný přehled školení http://xmlguru.cz/skoleni/ ------------------------------------------------------------------ Nejbližší termíny školení: ** XSLT 13.-16.3.2006 ** XML schémata 24.-26.4.2006 ** ** DocBook 15.-17.5.2006 ** XSL-FO 12.-13.6.2006 ** ------------------------------------------------------------------
Received on Thursday, 8 December 2005 22:13:34 UTC