- From: Bjoern Hoehrmann <derhoermi@gmx.net>
- Date: Wed, 23 Apr 2003 03:35:00 +0200
- To: www-international@w3.org
Hi, I am searching for a collection of sample documents in various encodings containing invalid octet sequences to test decoding routines. For example, the sequence 0x42, 0x6A, 0xF6, 0x72, 0x6E ("Björn" in ISO-8859-1) would be invalid in US-ASCII (8th bit set on 0xf6), UTF-8 (0xf6 is invalid start byte), UTF-16 (uneven number of octets), etc. Is there such a test suite available? TIA.
Received on Tuesday, 22 April 2003 21:35:18 UTC