W3C home > Mailing lists > Public > semantic-web@w3.org > November 2009

Re: XML problems with percent encoding

From: Richard Cyganiak <richard@cyganiak.de>
Date: Tue, 17 Nov 2009 12:56:28 +0100
Cc: Damian Steer <pldms@mac.com>, semantic-web@w3.org, Michael Martin <martin@informatik.uni-leipzig.de>, Christopher Jona Sahnwaldt <christopher@sahnwaldt.de>, Matthias Weidl <matthias.weidl@googlemail.com>, Anja Jentzsch <anja@anjeve.de>, Robert Isele <robertisele@gmail.com>
Message-Id: <AA3CE06E-019F-4E46-B3EA-78C592C94EEA@cyganiak.de>
To: Sebastian Hellmann <hellmann@informatik.uni-leipzig.de>
Just throwing in one comment: I *believe* (but am not 100% sure) that  
XML element names may contain characters outside of the US-ASCII  
range, such as “ä”. So, <dbpedia:längengrad> might actually be a valid  
XML element. I have no idea if this also applies to RDF/XML. The  
relationships between URIs, IRIs, RDF, XML, UTF-8 etc are incredibly  
complex... But it might be worth trying if a URI such as <http://de.dbpedia.org/property/längengrad 
 > is actually somehow allowed *in the RDF data model*, and how it  
would be serialized in the different RDF surface syntaxes.

Best,
Richard


On 17 Nov 2009, at 12:38, Sebastian Hellmann wrote:

> Hello,
> thank you very much, this will actually work. I tested it with OWL  
> API and Raptor
> and they both can read and reserialize it (Jena can also).
> OWL API even puts it in the header:
> <!ENTITY E "http://ko.dbpedia.org/property/ 
> &#37;EA&#37;B4&#37;91&#37;EC&#37;9E&#37;90" >
> ...
> <E:_>4.5</E:_>
>
> So, I guess, this is a valid workaround, although it results in a  
> strange rule for URI designers:
> "If your encoded URIs contain any % followed by two numbers, add an  
> underscore"
> But well, maybe focus will be on Turtle some day.
>
> @Matthias, please use the underscore
> (maybe we should even use it for all Korean (and other languages  
> than English) properties as it should be uniform)
>
> Many thanks,
> Sebastian
>
>
>
> Damian Steer schrieb:
>> Sebastian Hellmann wrote:
>>
>>> Hello,
>>>
>>>
>>>> You could look at adding an _
>>>> (http://ko.dbpedia.org/property/_%EA%B4%91%EC%9E%90) for the  
>>>> problem
>>>> cases, or add a fragment
>>>> (http://ko.dbpedia.org/property/%EA%B4%91%EC%9E%90#it)? The  
>>>> latter has
>>>> its own issues, so I'd try the former.
>>>>
>>>> Damian
>>>>
>>>>
>>> Does not seem to work, see below (third last line).     <dbp:_%EA 
>>> %B4%91%EC%9E%90>4.5</dbp:_%EA%B4%91%EC%9E%90>
>>>  </Lager>
>>> </rdf:RDF>
>>>
>>
>> Sorry Sebastian, fingers didn't type what I was thinking:
>>
>> http://ko.dbpedia.org/property/%EA%B4%91%EC%9E%90_
>>
>> <j.0:_
>> xmlns:j.0="http://ko.dbpedia.org/property/%EA%B4%91%EC%9E%90">4.5</ 
>> j.0:_>
>>
>> Damian
>>
>>
>>
>
>
> -- 
> Dipl. Inf. Sebastian Hellmann
> Department of Computer Science, University of Leipzig
> Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
> Research Group: http://aksw.org
>
>
Received on Tuesday, 17 November 2009 11:57:10 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 19:48:03 UTC