W3C home > Mailing lists > Public > www-international@w3.org > April to June 2003

Invalid encoding test cases

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Wed, 23 Apr 2003 03:35:00 +0200
To: www-international@w3.org
Message-ID: <3edce702.298679588@smtp.bjoern.hoehrmann.de>

Hi,

  I am searching for a collection of sample documents in various
encodings containing invalid octet sequences to test decoding routines.
For example, the sequence 0x42, 0x6A, 0xF6, 0x72, 0x6E ("Björn" in
ISO-8859-1) would be invalid in US-ASCII (8th bit set on 0xf6), UTF-8
(0xf6 is invalid start byte), UTF-16 (uneven number of octets), etc.

Is there such a test suite available?

TIA.
Received on Tuesday, 22 April 2003 21:35:18 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:00 GMT