[Prev][Next][Index][Thread]

Re: Reads like ASCII (was Re: character sets ...)



>> Autodetection also fails very quickly when faces with a number of
>> multibyte encodings. 
>
>Yes, but I am not proposing that: the only thing autodetected is
>whether the encoding is definitely 16-bit or not, for which you only
>need to look at the first two octets (i.e. if one is zero valued).

OK. What do you do if you have multiple pure 16 bit encodings? How
about UCS-4? What happens if the 8 bit encoding is *not* ASCII compatible?



References: