W3C home > Mailing lists > Public > public-rdf-dawg-comments@w3.org > October 2006

make regex optional in SPARQL

From: Henry Story <Henry.Story@Sun.COM>
Date: Mon, 23 Oct 2006 13:25:52 +0200
To: public-rdf-dawg-comments@w3.org
Message-id: <A67DC5D6-7FE9-4CB3-8801-7ACD280415A9@Sun.Com>

Hi,

I am a great SPARQL enthusiast [1], but have become concerned about  
the regex functionality in the specification. Having worked at  
AltaVista I have developed a feeling for volume and complexity. When  
one is dealing with 100 million searches a day or more every cpu  
instruction counts.

My feeling is that regex is much too powerful for any large database  
to allow it to be a mandatory part of the spec. (I may very well be  
wrong, if so please point me to a study that shows this not to be the  
case). Any such database will just not be able to implement such a  
query language. Regex therfore must be optional.

If possible it would be good to allow in addition for a simpler form  
of string query language that has been proven to work with large  
databases. This would treat words as atom units, allow their  
inclusion or exclusion from a string. Things like this

    sparql -cat "Danny Ayers"

which would find Danny's posts that don't involve cats. AltaVista did  
allow for simple suffix wildcards such as searches on cat* .

I am just posting this because regex raised an alarm flag in my head.  
Perhaps this has been dealt with before.

Henry

[1] http://blogs.sun.com/bblfish/entry/sparqling_roller among other  
posts

Home page: http://bblfish.net/
Sun Blog: http://blogs.sun.com/bblfish/
Foaf name: http://bblfish.net/people/henry/card#me
Received on Monday, 23 October 2006 11:25:35 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 January 2008 14:14:50 GMT