Re: Comments on Part 1: Encoding declaration

> From: Murata Makoto <murata@apsdc.ksp.fujixerox.co.jp>

> (4) Proposed changes
> 
> If an external text entity does not begin with a Byte Order
> Mark or an encoding declaration, XML processors may assume 
> that this entity is in the same encoding as the entity 
> that references to it.
> 
> If a document entity does not begin with a Byte Order
> Mark or an encoding declaration, XML processors may assume 
> that this entity is in the UTF-8 encoding.
> 
> XML processors may use other information to detect the
> actual encoding method, but are not required to do so.

I agree. think the first method is a preferable default, and it can get added to the list in the annex.
The important thing is that some definite strategy is in place:

* default is UTF-8 for top-level, or inherited if an external entity reference (!not a link).
* but BOM overrides this: Unicode
* but encoding PI overrides this
* else...any kind of autodetect or user preference list or locale-setting

Received on Tuesday, 3 June 1997 04:41:50 UTC