W3C home > Mailing lists > Public > public-cwm-bugs@w3.org > August 2004

UTF, non ASCII chars in subjects and predicates

From: meyer <meyer@ashleyassociates.co.jp>
Date: Wed, 04 Aug 2004 17:19:04 +0900
Message-ID: <41109BF8.3010609@ashleyassociates.co.jp>
To: public-cwm-bugs@w3.org

I am using the latest cvs checkout of cwm
predicates that end in non-ASCII(?) characters get merged with the 
namespace leaving an empty tag.
according to the xml specification it is possible to have utf8 chars in 
tags.
I also looked into the documentation about non ascii support , but i 
thought i mention it, since it 'almost' works :)
Probably im just doing something wrong ...

greetings
Joerg

P.S: i think n3 is the best thing since the invention of the 'brötchen'. :)
I'm just to stupid to grab it fully. The last time i felt like that, was 
with my very first computer: a commodore 500. ;)

my n3 file is:
"""
@prefix : <#>.
:僕 :名前 "ヨーク".
"""

Everything works fine, but the predicate get merged with the namespace 
and leaves an empty tag:
"""
<rdf:RDF xmlns="japanese.n3#名前"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
     <rdf:Description rdf:about="#僕">
         <>ヨーク</>
     </rdf:Description>
</rdf:RDF>
"""

if i add a ascii predicate it gets even funnier:
"""
@prefix : <#>.
:僕 :名前 "ヨーク".
:my :name "Joerg".
"""

"""
<rdf:RDF xmlns="japanese.n3#"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
     <rdf:Description rdf:about="#my">
         <name>Joerg</name>
     </rdf:Description>
     <rdf:Description rdf:about="#僕">
         < xmlns="japanese.n3#名前">ヨーク</>
     </rdf:Description>
</rdf:RDF>
"""

If i end the predicate with an ascii character however, it starts working:
"""
@prefix : <#>.
:僕 :名前a "ヨーク".
:my :name "Joerg".
"""

"""
<rdf:RDF xmlns="japanese.n3#"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
     <rdf:Description rdf:about="#my">
         <name>Joerg</name>
     </rdf:Description>
     <rdf:Description rdf:about="#僕">
         <a xmlns="japanese.n3#名前">ヨーク</a>
     </rdf:Description>
</rdf:RDF>
"""

and translated back:
"""
#Processed by Id: cwm.py,v 1.158 2004/07/23 18:54:17 syosi Exp
#    using base file:out.rdf

#  Notation3 generation by
#       notation3.py,v 1.160 2004/07/22 18:04:44 syosi Exp

#   Base was: file:out.rdf
      @prefix : <japanese.n3#> .
      @prefix out: <#> .

     out:my     :name "Joerg" .

     out:僕     :名前a "\u30E8\u30FC\u30AF" .

#ENDS
"""
Received on Wednesday, 4 August 2004 04:19:09 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 19:52:00 UTC