Hi, I am searching for a collection of sample documents in various encodings containing invalid octet sequences to test decoding routines. For example, the sequence 0x42, 0x6A, 0xF6, 0x72, 0x6E ("Björn" in ISO-8859-1) would be invalid in US-ASCII (8th bit set on 0xf6), UTF-8 (0xf6 is invalid start byte), UTF-16 (uneven number of octets), etc. Is there such a test suite available? TIA.Received on Tuesday, 22 April 2003 21:35:18 UTC
This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:40:47 UTC