- From: Henry S. Thompson <ht@cogsci.ed.ac.uk>
- Date: 02 Jul 2002 09:48:58 +0100
- To: Mark Feblowitz <mfeblowitz@frictionless.com>
- Cc: "Xmlschema-Dev (E-mail)" <xmlschema-dev@w3.org>
Mark Feblowitz <mfeblowitz@frictionless.com> writes:
> Probably a FAQ, but not one I've encountered.
>
> Can all available validating parsers validate an XML instance represented in
> UTF-16 against an XML Schema represented in UTF-8?
Unless they are non-conformant at the XML level, they should.
Infosets contain characters, not encodings, so the encoding should be
invisible at the schema validation level.
> Would anything special need to be done to achieve the validation, or would
> it suffice for each of them to have their encodings appropriately indicated?
That should do it.
> The intent here is to use a given Schema, represented in UTF-8, to validate
> an instance document that uses the same element and attribute labels, yet
> the element and/or attribute content requires UTF-16, e.g., strings
> containing accented French.
(That doesn't require UTF-16 -- UTF-8 works fine for any Unicode character.)
ht
--
Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
W3C Fellow 1999--2002, part-time member of W3C Team
2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
URL: http://www.ltg.ed.ac.uk/~ht/
[mail really from me _always_ has this .sig -- mail without it is forged spam]
Received on Tuesday, 2 July 2002 04:49:01 UTC