Re: Big data applications for general users based on RDF - where are they?

I find UML to be a useful tool for visualizing OWL ontologies, especially when I encounter a new one that looks interesting. GoodRelations has a decent example of what this approach would look like:

http://www.heppnetz.de/ontologies/goodrelations/goodrelations-UML.png

UML has its limits (e.g. visualizing owl:subPropertyOf), but having the ontological terms bound to specific UML graphical constructs helps keep most things straight.

Jeff

Sent from my iPad

On Jun 22, 2013, at 4:47 PM, "Todd DeLuca" <todddeluca@gmail.com<mailto:todddeluca@gmail.com>> wrote:

Hi Dominic,

I'll bite.  I am a newcomer to RDF, LOD, and the Semantic Web.  Serializing data to RDF has been easy for me, since it's not any different from writing CSV or XML files.

On the other hand, knowing what predicates and classes to use has been quite difficult.  The world of ontologies is large, overlapping, disorganized and poorly documented.  I need to know what predicates are being used for my field (orthology), so I can use the same ones.  Sadly, there are some problems with this:

- the vast majority of trivial examples I see use FOAF, but I'm making a bioinformatics app.
- there are multiple orthology predicates defined in ontologies such as the Homology Ontology and the Sequence Ontology.
- as far as I recall, the few existing implementation of orthology databases made up their own predicates, which misses the whole point of shared URIs for entities and properties, IMO.
- most documentation and cultural knowledge seems to live in people's heads on mailing lists.  I've never subscribed to so many email lists or contacted so many strangers out of the blue as for this project.

On the technical side, another enormous challenge has been the lack of a great open-source database.  So far I've used StarDog, Sesame, Virtuoso (open source), and OWLIM-Lite.  They have a wide range of quirks and differences between them in terms of loading speed, inferencing capabilities, scalability, ease-of-installation and configuration, query performance, APIs, edge cases of standards (like how to treat an empty named graph), etc.

Were I to do this project as a relational database, I would `apt-get install postgres`, create some tables, make up my column names, and be done with it, and get on with writing queries and a friendly web UI for the my non-techical users, who do not know how to use grep or sed. :-)  I hardly exaggerate when I say it is the difference between one week to implement a RDBMS solution versus 2 months for the RDF/Semantic Web/LoD solution.

OBVIOUSLY the semantic web is the future of data integration, but currently the cultural and technical costs of implementing a project must be too high for most people.

Cheers,
Todd









On Sat, Jun 22, 2013 at 3:21 PM, Dominic Oldman <doint@oldman.me.uk<mailto:doint@oldman.me.uk>> wrote:

I think it well worth copying Jeff's initial response. I would be interested in responses to it.

"It's pretty easy to write an XSL stylesheet to convert "records" into RDF/XML, and then write a little M/R job to run the XSL against a big bulk of records to boil it down.

The intellectual challenge is the semantic mapping of idiomatic data into RDF vocabulary terms.

Jeff"

Dominic

Sent from Yahoo! Mail on Android


________________________________
From: Hugh Glaser <hg@ecs.soton.ac.uk<mailto:hg@ecs.soton.ac.uk>>;
To: Young,Jeff (OR) <jyoung@oclc.org<mailto:jyoung@oclc.org>>;
Cc: doint@oldman.me.uk<mailto:doint@oldman.me.uk> <doint@oldman.me.uk<mailto:doint@oldman.me.uk>>; public-lod@w3 org <public-lod@w3.org<mailto:public-lod@w3.org>>;
Subject: Re: Big data applications for general users based on RDF - where are they?
Sent: Sat, Jun 22, 2013 6:04:57 PM

Ah, now yer rocking!
But you didn't mention sed (and vi) :-)

On 22 Jun 2013, at 18:57, "Young,Jeff (OR)" <jyoung@oclc.org>
wrote:

> Hugh,
>
> Sorry, you're right. I overlooked the "non-technical uses" phrase in Dominic's message.
>
> Let me spin it a little differently, then. If you're a techie, you can use these tools to create N-Triple data-dumps that non-techies can download and use with Unix-style commands like grep and sort and wc.
>
> Jeff
>
>> -----Original Message-----
>> From: Hugh Glaser [mailto:hg@ecs.soton.ac.uk]
>> Sent: Saturday, June 22, 2013 1:53 PM
>> To: Young,Jeff (OR)
>> Cc: doint@oldman.me.uk; public-lod@w3 org
>> Subject: Re: Big data applications for general users based on RDF -
>> where are they?
>>
>> Hi Jeff,
>> I assume you aren't suggesting that such tools are suitable for "non-
>> technical users", as Dominic asked.
>> So you must be saying something else?
>> That it is pretty easy, but people don't do it?
>> Hugh
>>
>> On 22 Jun 2013, at 17:27, "Young,Jeff (OR)" <jyoung@oclc.org>
>> wrote:
>>
>>> It's pretty easy to write an XSL stylesheet to convert "records" into
>> RDF/XML, and then write a little M/R job to run the XSL against a big
>> bulk of records to boil it down.
>>>
>>> The intellectual challenge is the semantic mapping of idiomatic data
>> into RDF vocabulary terms.
>>>
>>> Jeff
>>>
>>> From: Dominic Oldman [mailto:doint@oldman.me.uk]
>>> Sent: Saturday, June 22, 2013 12:16 PM
>>> To: public-lod@w3 org
>>> Subject: Big data applications for general users based on RDF - where
>> are they?
>>>
>>>
>>> Why are there so few useful linked data applications for general non
>> technical users that provide functions that people need to support and
>> enhance their work and which operate over large amounts of data owned
>> by different organisations with a high degree of semantic
>> interoperability and robustness?
>>>
>>> Dominic
>>>
>>> Sent from Yahoo! Mail on Android
>>>
>>>
>>
>
>






--
Todd DeLuca
Scientific Programmer
Wall Lab, CBMI, Harvard Medical School
http://todddeluca.com
http://wall.hms.harvard.edu/

Received on Saturday, 22 June 2013 22:38:47 UTC