- From: Juan Sequeda <juanfederico@gmail.com>
- Date: Tue, 4 Feb 2014 01:23:42 -0600
- To: Giovanni Tummarello <g.tummarello@gmail.com>
- Cc: Hugh Glaser <hugh@glasers.org>, "public-lod@w3.org" <public-lod@w3.org>
- Message-ID: <CAMVTWDyhFUfboXS2Ajzryh+3472d62ne9ac=im=vYs+PgaJn8g@mail.gmail.com>
I wonder who would be willing to pay for a service like Sindice. Juan Sequeda +1-575-SEQ-UEDA www.juansequeda.com On Fri, Jan 31, 2014 at 9:24 AM, Giovanni Tummarello <g.tummarello@gmail.com > wrote: > Thanks Hugh, > > crawling the web accurately is a billion dollar thing nowadays (not my > words) and all the big guys accurately crawl all the metadata now (though > dont give any public api). > > I still think a more "focused" version of sindice e.g. just on demand etc > might be useful and have impact but resources are necessarely limited > > Announcements with respect to the rest are coming next week :) > have a good weekend and thanks for the thanks, appreciated. > > Gio > > > On Thu, Jan 30, 2014 at 8:13 PM, Hugh Glaser <hugh@glasers.org> wrote: > >> Hi Giovanni, >> Thank you for the update. >> I am sorry to hear that Sindice is going into a frozen state, and that >> circumstances are making that happen, but of course pleased that you are >> able to keep it going at all. >> I send you and your team my personal thanks for the service you have >> provided over the last 5 or so years, and wish you all well. >> Very best >> Hugh. >> >> >> On 28 Jan 2014, at 14:19, Giovanni Tummarello <g.tummarello@gmail.com> >> wrote: >> >> > With respect to Sindice >> > >> > for a number of reasons, the people who originally created it, the >> former Data Intensive Infrastructure group, are either not working in the >> original institution hosting it, National University of Ireland Galway, >> institute formerly known as DERI or have been assigned to other tasks. >> > >> > Sindice has been operating for 5+ years, updating its index, (though we >> were never perfect) and we believe supported a lot of works on the field, >> but its now time to move on. In the meanwhile the project will continue >> answer queries but without updating its index. >> > >> > Apologies for the inconvenience of course, we'll be posting on this >> soon and update the homepage to reflect the change. >> > >> > Giovanni >> > >> > >> > >> > On Tue, Jan 28, 2014 at 11:27 AM, Hugh Glaser <hugh@glasers.org> wrote: >> > Good question. >> > I'll report what I found, rather than advising. >> > >> > So I went there when you published that email, looking for stuff to put >> in my sameas.org site. >> > I tried exploring, and when I went to Browse I only found a few things, >> so wasn't encouraged :-) >> > (And, as an aside, Advanced Search didn't seem to do anything, and the >> search links at the bottom were not links.) >> > So I decided that it wasn't really mature enough to make it worth the >> effort (yet?), even though there should be massive scope for linkage >> eventually. >> > >> > But the real problem was that I couldn't find any Linked Data, or even >> an RDF store. >> > The URIs you use are not very Cool URIs, and I tried to see if there >> was RDF at the end of them by doing Content Negotiation, but there wasn't. >> > I am thinking of things like >> http://tundra.csd.sc.edu/rol/view-person.php?id=291 >> > >> > So I went away :-) >> > >> > For people like me, you could put something about how to see the RDF in >> an About page (or if it is there, make it easier to find). You only get one >> chance to snare people on the web, after all. >> > Of course as Alfredo says, for spidering search engines, and it would >> have helped me too, you need robots.txt (which I couldn't find either), >> sitemap, sitemap.xml, voiD description. >> > >> > Good luck! >> > Hugh >> > >> > On 28 Jan 2014, at 04:12, WILDER, COLIN <WILDERCF@mailbox.sc.edu> >> wrote: >> > >> > > Another question to you very helpful people- >> > > >> > > <and apologies again for semi cross-posting> >> > > >> > > Our LOD working group is having trouble publishing our data (see >> email below) in RDF form. Our programmer, a master's student, who is >> working under the supervision of myself and a computer science professor, >> has mapped sample data into RDF, has the triplestore on a D2RQ server >> (software) on our server and has set up a SPARQL end-point on the latter. >> But he has been unsuccessful so far getting 3 candidate semantic web search >> engines (Falcons, Swoogle and Sindice) to be able to find our data when he >> puts a test query in to them. He has tried communicating with the people >> who run these, but to little avail. Any suggestions about sources of >> information, pointers, best practices for this actual process of publishing >> LOD? Or, if you know of problems with any of those three search engines and >> would suggest a different candidate, that would be great too. >> > > >> > > Thanks again, >> > > >> > > Colin Wilder >> > > >> > > >> > > From: WILDER, COLIN [mailto:WILDERCF@mailbox.sc.edu] >> > > Sent: Thursday, January 16, 2014 11:51 AM >> > > To: 'public-lod@w3.org' >> > > Subject: LOD for historical humanities information about people and >> texts >> > > >> > > To the many people who have kindly responded to my recent email: >> > > >> > > Thanks for your suggestions and clarifying questions. To explain a >> bit better, we have a data curation platform called RL, which is a large, >> complex web-based MySQL database designed for users to be able to simply >> input, store and share data about social and textual networks with each >> other, or to share it globally in RL's data commons. The data involved are >> individual data items, such as info about one person's name, age, a book >> title, a specific social relationship, etc. The entity types (in the >> ordinary-language sense of actors and objects, not in the database tabular >> sense) can be seen athttp://tundra.csd.sc.edu/rol/browse.php. The data >> commons in RL is basically a subset of user data that users have elected >> (irrevocably) to share with all other users of the system. NB there is a >> lot of dummy data in the data commons right now because of testing. >> > > >> > > We are designing an expansion of RL's functionality so as to publish >> data from the data commons as LOD, so I am doing some preliminary work to >> assess feasibility and fit by matching up our entity types with >> RDFvocabularies. Here is what I have so far. First are the entity(ies) and >> relationships, followed by the appropriate vocabularies: >> > > >> > > 1. Persons, social relations: FOAF, BIO. The "Catalogus >> Professorum Lipsiensis" or CPL( >> http://svn.aksw.org/papers/2010/ISWC_CP/public.pdf) looks enormously >> useful for connecting academics (people), their relations and their books. >> But, I cannot seem to get any info page or specification page to load, >> making me worry that it's dead. >> > > 2. Membership in organizations: ORG >> > > 3. Enrollment in an academic course (e.g. a lecture course): >> ??? maybe use a RDF container or RDF collection type of resource to list >> all students enrolled in a certain course? >> > > 4. Travel: ??? We are trying to encode trips, in which one or >> more people leave one place at one time and arrive at another place at >> another time. This thus links people, places and times. >> > > 5. Texts - i.e. old editions of books and manuscripts: Dublin >> Core, Bibframe. Use FRBR to distinguish sub- and pre-edition levels of >> manuscripts, works and ideas. >> > > 6. Relationship among texts, including intertexts and >> citations: Bibliographic ontology (Bibo) >> > > 7. Collections of texts in historical library catalogs, e.g. >> from centuries ago: the DC Collection AP. Maybe also the Bibliographic >> Reference Ontology (BiRO)? >> > > >> > > My understanding is that the Linked Open Vocabulary cloud (LOV) is a >> useful tool for finding relevant ontologies. The Vocabulary of Interlinked >> Datasets (VoID) seems more like underlying infrastructure - the tool to >> translate and link data items in a dataset written in one vocabulary to >> data items in a set written in another. >> > > >> > > Any further help or clarifications are much appreciated. Thanks again- >> > > >> > > Colin >> > > >> > > >> > > ---------------- >> > > Dr. Colin F. Wilder >> > > Associate Director >> > > Center for Digital Humanities (website; projects page) >> > > Thomas Cooper Library, University of South Carolina >> > > 1322 Greene St., Columbia, SC 29208 >> > > Phones: office (803) 777-2810 & mobile (603) 831-3998 >> > > Emails: wildercf@mailbox.sc.edu & colinwilder@gmail.com >> > > open office hours (use week view in upper right) >> > > frango ut patefaciam >> > >> > -- >> > Hugh Glaser >> > 20 Portchester Rise >> > Eastleigh >> > SO50 4QS >> > Mobile: +44 75 9533 4155, Home: +44 23 8061 5652 >> > >> > >> > >> > >> >> -- >> Hugh Glaser >> 20 Portchester Rise >> Eastleigh >> SO50 4QS >> Mobile: +44 75 9533 4155, Home: +44 23 8061 5652 >> >> >> >
Received on Tuesday, 4 February 2014 07:24:33 UTC