- From: Richard Tobin <richard@cogsci.ed.ac.uk>
- Date: Fri, 7 Nov 2003 14:59:56 GMT
- To: Glenn Marcy <gmarcy@us.ibm.com>, public-xml-testsuite@w3.org
- Cc: Richard Tobin <richard@cogsci.ed.ac.uk>
> (1) ibm-valid-P02-ibm02v01.xml > The UTF-8 code for LSEP (2028) in this file seems to be wrong. > I believe it should be e2 80 a8, the file has e0 9f ac which is > a non-shortest UTF-8 sequence for something else. > > [GM] Agree, a typo, the byte sequence corresponds to the character #x7EC > and should be changed to e2 80 a8, but its still a valid document. It's not the shortest sequence for 7EC, so it's a UTF-8 error and therefore not well-formed. -- Richard
Received on Friday, 7 November 2003 10:00:02 UTC