Invalid encoding test cases

Hi,

  I am searching for a collection of sample documents in various
encodings containing invalid octet sequences to test decoding routines.
For example, the sequence 0x42, 0x6A, 0xF6, 0x72, 0x6E ("Björn" in
ISO-8859-1) would be invalid in US-ASCII (8th bit set on 0xf6), UTF-8
(0xf6 is invalid start byte), UTF-16 (uneven number of octets), etc.

Is there such a test suite available?

TIA.

Received on Tuesday, 22 April 2003 21:35:18 UTC