Re: LOD publishing question

I wonder who would be willing to pay for a service like Sindice.

Juan Sequeda
+1-575-SEQ-UEDA
www.juansequeda.com


On Fri, Jan 31, 2014 at 9:24 AM, Giovanni Tummarello <g.tummarello@gmail.com
> wrote:

> Thanks Hugh,
>
> crawling the web accurately is a billion dollar thing nowadays (not my
> words) and all the big guys accurately crawl all the metadata now (though
> dont give any public api).
>
> I still think a more "focused" version of sindice e.g. just on demand etc
> might be useful and have impact but resources are necessarely limited
>
> Announcements with respect to the rest are coming next week :)
> have a good weekend and thanks for the thanks, appreciated.
>
> Gio
>
>
> On Thu, Jan 30, 2014 at 8:13 PM, Hugh Glaser <hugh@glasers.org> wrote:
>
>> Hi Giovanni,
>> Thank you for the update.
>> I am sorry to hear that Sindice is going into a frozen state, and that
>> circumstances are making that happen, but of course pleased that you are
>> able to keep it going at all.
>> I send you and your team my personal thanks for the service you have
>> provided over the last 5 or so years, and wish you all well.
>> Very best
>> Hugh.
>>
>>
>> On 28 Jan 2014, at 14:19, Giovanni Tummarello <g.tummarello@gmail.com>
>> wrote:
>>
>> > With respect to Sindice
>> >
>> > for a number of reasons, the people who originally created it, the
>> former Data Intensive Infrastructure group, are either not working in the
>> original institution hosting it, National University of Ireland Galway,
>> institute formerly known as DERI or have been assigned to other tasks.
>> >
>> > Sindice has been operating for 5+ years, updating its index, (though we
>> were never perfect) and we believe supported a lot of works on the field,
>>  but its now time to move on.  In the meanwhile the project will continue
>> answer queries but without updating its index.
>> >
>> > Apologies for the inconvenience of course, we'll be posting on this
>> soon and update the homepage to reflect the change.
>> >
>> > Giovanni
>> >
>> >
>> >
>> > On Tue, Jan 28, 2014 at 11:27 AM, Hugh Glaser <hugh@glasers.org> wrote:
>> > Good question.
>> > I'll report what I found, rather than advising.
>> >
>> > So I went there when you published that email, looking for stuff to put
>> in my sameas.org site.
>> > I tried exploring, and when I went to Browse I only found a few things,
>> so wasn't encouraged :-)
>> > (And, as an aside, Advanced Search didn't seem to do anything, and the
>> search links at the bottom were not links.)
>> > So I decided that it wasn't really mature enough to make it worth the
>> effort (yet?), even though there should be massive scope for linkage
>> eventually.
>> >
>> > But the real problem was that I couldn't find any Linked Data, or even
>> an RDF store.
>> > The URIs you use are not very Cool URIs, and I tried to see if there
>> was RDF at the end of them by doing Content Negotiation, but there wasn't.
>> > I am thinking of things like
>> http://tundra.csd.sc.edu/rol/view-person.php?id=291
>> >
>> > So I went away :-)
>> >
>> > For people like me, you could put something about how to see the RDF in
>> an About page (or if it is there, make it easier to find). You only get one
>> chance to snare people on the web, after all.
>> > Of course as Alfredo says, for spidering search engines, and it would
>> have helped me too, you need robots.txt (which I couldn't find either),
>> sitemap, sitemap.xml, voiD description.
>> >
>> > Good luck!
>> > Hugh
>> >
>> > On 28 Jan 2014, at 04:12, WILDER, COLIN <WILDERCF@mailbox.sc.edu>
>> wrote:
>> >
>> > > Another question to you very helpful people-
>> > >
>> > > <and apologies again for semi cross-posting>
>> > >
>> > > Our LOD working group is having trouble publishing our data (see
>> email below) in RDF form. Our programmer, a master's student, who is
>> working under the supervision of myself and a computer science professor,
>> has mapped sample data into RDF, has the triplestore on a D2RQ server
>> (software) on our server and has set up a SPARQL end-point on the latter.
>> But he has been unsuccessful so far getting 3 candidate semantic web search
>> engines (Falcons, Swoogle and Sindice) to  be able to find our data when he
>> puts a test query in to them. He has tried communicating with the people
>> who run these, but to little avail. Any suggestions about sources of
>> information, pointers, best practices for this actual process of publishing
>> LOD? Or, if you know of problems with any of those three search engines and
>> would suggest a different candidate, that would be great too.
>> > >
>> > > Thanks again,
>> > >
>> > > Colin Wilder
>> > >
>> > >
>> > > From: WILDER, COLIN [mailto:WILDERCF@mailbox.sc.edu]
>> > > Sent: Thursday, January 16, 2014 11:51 AM
>> > > To: 'public-lod@w3.org'
>> > > Subject: LOD for historical humanities information about people and
>> texts
>> > >
>> > > To the many people who have kindly responded to my recent email:
>> > >
>> > > Thanks for your suggestions and clarifying questions. To explain a
>> bit better, we have a data curation platform called RL, which is a large,
>> complex web-based MySQL database designed for users to be able to simply
>> input, store and share data about social and textual networks with each
>> other, or to share it globally in RL's data commons. The data involved are
>> individual data items, such as info about one person's name, age, a book
>> title, a specific social relationship, etc. The entity types (in the
>> ordinary-language sense of actors and objects, not in the database tabular
>> sense) can be seen athttp://tundra.csd.sc.edu/rol/browse.php. The data
>> commons in RL is basically a subset of user data that users have elected
>> (irrevocably) to share with all other users of the system. NB there is a
>> lot of dummy data in the data commons right now because of testing.
>> > >
>> > > We are designing an expansion of RL's functionality so as to publish
>> data from the data commons as LOD, so I am doing some preliminary work to
>> assess feasibility and fit by matching up our entity types with
>> RDFvocabularies. Here is what I have so far. First are the entity(ies) and
>> relationships, followed by the appropriate vocabularies:
>> > >
>> > > 1.       Persons, social relations: FOAF, BIO. The "Catalogus
>> Professorum Lipsiensis" or CPL(
>> http://svn.aksw.org/papers/2010/ISWC_CP/public.pdf) looks enormously
>> useful for connecting academics (people), their relations and their books.
>>  But, I cannot seem to get any info page or specification page to load,
>> making me worry that it's dead.
>> > > 2.       Membership in organizations: ORG
>> > > 3.       Enrollment in an academic course (e.g. a lecture course):
>> ??? maybe use a RDF container or RDF collection type of resource to list
>> all students enrolled in a certain course?
>> > > 4.       Travel: ??? We are trying to encode trips, in which one or
>> more people leave one place at one time and arrive at another place at
>> another time. This thus links people, places and times.
>> > > 5.       Texts - i.e. old editions of books and manuscripts: Dublin
>> Core, Bibframe. Use FRBR to distinguish sub- and pre-edition levels of
>> manuscripts, works and ideas.
>> > > 6.       Relationship among texts, including intertexts and
>> citations: Bibliographic ontology (Bibo)
>> > > 7.       Collections of texts in historical library catalogs, e.g.
>> from centuries ago: the DC Collection AP. Maybe also the Bibliographic
>> Reference Ontology (BiRO)?
>> > >
>> > > My understanding is that the Linked Open Vocabulary cloud (LOV) is a
>> useful tool for finding relevant ontologies. The Vocabulary of Interlinked
>> Datasets (VoID) seems more like underlying infrastructure - the tool to
>> translate and link data items in a dataset written in one vocabulary to
>> data items in a set written in another.
>> > >
>> > > Any further help or clarifications are much appreciated. Thanks again-
>> > >
>> > > Colin
>> > >
>> > >
>> > > ----------------
>> > > Dr. Colin F. Wilder
>> > > Associate Director
>> > > Center for Digital Humanities (website; projects page)
>> > > Thomas Cooper Library, University of South Carolina
>> > > 1322 Greene St., Columbia, SC 29208
>> > > Phones: office (803) 777-2810 & mobile (603) 831-3998
>> > > Emails: wildercf@mailbox.sc.edu & colinwilder@gmail.com
>> > > open office hours (use week view in upper right)
>> > > frango ut patefaciam
>> >
>> > --
>> > Hugh Glaser
>> >    20 Portchester Rise
>> >    Eastleigh
>> >    SO50 4QS
>> > Mobile: +44 75 9533 4155, Home: +44 23 8061 5652
>> >
>> >
>> >
>> >
>>
>> --
>> Hugh Glaser
>>    20 Portchester Rise
>>    Eastleigh
>>    SO50 4QS
>> Mobile: +44 75 9533 4155, Home: +44 23 8061 5652
>>
>>
>>
>

Received on Tuesday, 4 February 2014 07:24:33 UTC