W3C home > Mailing lists > Public > public-cwm-bugs@w3.org > August 2008

Re: cwm has problem parsing a foaf file that passes the validator tests

From: Yosi Scharf <syosi@MIT.EDU>
Date: Sun, 17 Aug 2008 01:15:55 -0400
Message-ID: <48A7B40B.2090205@mit.edu>
To: Story Henry <henry.story@bblfish.net>
CC: public-cwm-bugs@w3.org

That file is written in ISO-8859-15 encoding. Cwm assumes everything in
the world is utf-8. Normally this is not a problem, because the python
libraries it uses to get things from the web convert things correctly.
At least I think that's the way it works. In this case, the file is
being sent with

Content-Type: text/plain; charset=UTF-8

so no conversion is being done in any case. Cwm is perfectly within its
rights to give an error that this is not a valid utf-8.

Yosi Scharf


Story Henry wrote:
> seen on #swig:
>
> mhh David Peterson's foaf is accepted by the w3c's validator but not
> by cwm
> [07:34AM] bblfish_: cwm --rdf  http://data.boab.info/david/foaf.rdf
> [07:34AM] bblfish_: does not work
> [07:35AM] bblfish_: but it's ok for
> http://www.w3.org/RDF/Validator/ARPServlet?URI=http%3A%2F%2Fdata.boab.info%2Fdavid%2Ffoaf.rdf&PARSE=Parse+URI%3A+&TRIPLES_AND_GRAPH=PRINT_TRIPLES&FORMAT=PNG_EMBED
>
>
> Henry
>
> Home page: http://bblfish.net/
>
Received on Sunday, 17 August 2008 05:17:09 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Sunday, 17 August 2008 05:17:09 GMT