- From: Denis Gaertner <denis_gaertner@gmx.net>
- Date: Wed, 13 Sep 2006 13:30:18 +0200
- To: public-sparql-dev@w3.org
Hi,
I got another question.
It's about character escaping in the regex function. There seem to be
two ways for this.
\\x00 for one character which is ASCII and \u0000 for a unicode
codepoint. This goes fine. I tried \\x{..} as well but doesn't seem to
work in my environment. My problem is that I get escaped characters
which have a longer hexcode in chunks of two, i.e. u + Umlaut U+00FC /
C3BC as "\C3\BC". If you have only characters like that in a foreign
script you get a whole line like this and it is a problem on how to know
which is which. So I was wondering if it is somehow possible to simply
transfer this to \\xc3\\xbc.. in a regular expression without having to
use unicode codepoints.
Thanks again
Denis
Received on Wednesday, 13 September 2006 11:30:33 UTC