RDF schema encoding/labelling bug

I think the RDF schema for RDF isn't well-formed!
http://www.w3.org/2000/01/rdf-schema

I'm messing around with XML Schemas for RDF, and
I get this error from XSV:

"Low-level XML well-formedness and/or validity processing output

Error: Input error: Illegal UTF-8 byte 2 <0x74> at file offset 1925
 in unnamed entity at line 52 char 29 of
http://www.w3.org/2000/01/rdf-schema"

but xmlwf (i.e. expat) says rdf-schema is well-formed.

It could be an HTTP related problem... the server
labels it as
	Content-Type: text/plain

which implies charset=iso-8859-1 ... if XSV
is (correctly) inferring that encoding, I might
understand why there are problems; but then:
why is it complaining about UTF-8 bytes? Does
it convert iso-8859-1 to UTF-8 and then complain
about the results or something?

Let me try charlint... hmm... it agrees:
"Line 52: Non-UTF-8 (synchronization problem)."

Looks like there's latin1 stuff in the file, but
it's not labelled. This is bad.

I suggest sticking with US-ASCII and using &#nnn;
for stuff that can't be encoded that way.

For my purposes this evening, I worked around
the problem by copying the schema to
http://www.w3.org/2000/07/rdf-schema-lat1.xml
and adding an encoding declaration.

-- 
Dan Connolly, W3C http://www.w3.org/People/Connolly/

Received on Wednesday, 26 July 2000 01:18:48 UTC