W3C home > Mailing lists > Public > public-rdf-comments@w3.org > March 2012

test-29: special characters in Turtle IRIs

From: Henry Story <henry.story@bblfish.net>
Date: Fri, 2 Mar 2012 08:19:00 +0100
Message-Id: <115F77A1-BCA3-42E5-BB97-D604AA64C4DE@bblfish.net>
To: public-rdf-comments@w3.org
pretty much the only positive test that fails for me at present consistently across Jena, Sesame and my 
implementation is Test-29.ttl [1] which contains the following statement

<http://example.org/node> <http://example.org/prop> <scheme:\u0001\u0002\u0003\u0004\u0005\u0006\u0007\u0008\t\n\u000B\u000C\r\u000E\u000F\u0010\u0011\u0012\u0013\u0014\u0015\u0016\u0017\u0018\u0019\u001A\u001B\u001C\u001D\u001E\u001F !"#$%&'()*+,-./0123456789:/<=\u003E?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\\]^_`abcdefghijklmnopqrstuvwxyz{|}~\u007F> .

This is causing the apache abdera IRI [2] library to barf . It looks like they put a lot of energy into this library, and so that's made me wonder where the error lies. This can be reproduced like this on the scala console

scala> import org.apache.abdera.i18n.iri._
scala> val iriStr = "scheme:\u0001\u0002\u0003\u0004\u0005\u0006\u0007\u0008\t\n\u000B\u000C\r\u000E\u000F\u0010\u0011\u0012\u0013\u0014\u0015\u0016\u0017\u0018\u0019"
[line elided for control chars: possibly a scala signature]
scala> val iriStr2 = "\u001A\u001B\u001C\u001D\u001E\u001F !\"#$%&'()*+,-./0123456789:/<=\u003E?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\\]^_`abcdefghijklmnopqrstuvwxyz{|}~\u007F"
[line elided for control chars: possibly a scala signature]
scala> val iri = iriStr + iriStr2 
scala> val i = new IRI(iri)
org.apache.abdera.i18n.iri.IRISyntaxException: org.apache.abdera.i18n.text.InvalidCharacterException: Invalid Character 0x1(?)
	at org.apache.abdera.i18n.iri.IRI.parse(IRI.java:577)
	at org.apache.abdera.i18n.iri.IRI.<init>(IRI.java:64)
       ...


I looked at http://tools.ietf.org/html/rfc3987 to see what the spec said there, but I don't think those characters are
allowed. Can I remove this from the examples? What should I replace it with that would test the spec? Should we move this 
one to a bad-test?

	Henry


[1]  http://www.w3.org/TR/turtle/tests/test-29.ttl
[2]  http://grepcode.com/file/repo1.maven.org/maven2/org.apache.abdera/abdera-i18n/1.1.2/org/apache/abdera/i18n/iri/IRI.java
     http://abdera.apache.org/



Social Web Architect
http://bblfish.net/
Received on Friday, 2 March 2012 07:19:31 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:59:30 UTC