Re: For review: Migrating to Unicode

Frank Ellermann scripsit:

> | Outgoing XML should always be encoded in UTF-8
> 
> Maybe add "or its proper subset US-ASCII", because that avoids any
> potential problems with a text/xml Content-Type.  Maybe say this:
> "but note that US-ASCII is the default for Content-Type text/xml".

Not only the default, but a default that is only overridable in the
Content-Type header itself, not in the XML prologue.  For that reason,
text/xml should be avoided.

> | Windows-1252, an extension of ISO-8859-1
> 
> Is "extension" strictly correct ?  Or is it only a "variation" ?

Extension is strictly correct.  ISO 8859-1 does not assign meaning to
the bytes 0x80-0x9F (the overall framework may assign them meaning as
control characters), but Windows-1252 does.  In the 0xA0-0xFF range,
they are identical.

-- 
Mos Eisley spaceport.  You will never           John Cowan
see a more wretched hive of scum and            cowan@ccil.org
villainy -- unless you watch the                http://www.ccil.org/~cowan
Jerry Springer Show.   --georgettesworld.com

Received on Friday, 21 March 2008 02:14:51 UTC