Re: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC co-mentor]

I understand the need for commitments.
Let’s take it offline?

-Nicolas.





From: Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de<mailto:kontokostas@informatik.uni-leipzig.de>>
Date: Thursday, March 6, 2014 at 11:32 PM
To: Nicolas Torzec <torzecn@yahoo-inc.com<mailto:torzecn@yahoo-inc.com>>
Cc: "semantic-web@w3.org<mailto:semantic-web@w3.org>" <semantic-web@w3.org<mailto:semantic-web@w3.org>>, Linked Data community <public-lod@w3.org<mailto:public-lod@w3.org>>, DBpedia Discussions <dbpedia-discussion@lists.sourceforge.net<mailto:dbpedia-discussion@lists.sourceforge.net>>, DBpediaDevelopers <dbpedia-developers@lists.sourceforge.net<mailto:dbpedia-developers@lists.sourceforge.net>>, "dbp-spotlight-users@lists.sourceforge.net<mailto:dbp-spotlight-users@lists.sourceforge.net>" <dbp-spotlight-users@lists.sourceforge.net<mailto:dbp-spotlight-users@lists.sourceforge.net>>, DBpediaSpotlight Developers <dbp-spotlight-developers@lists.sourceforge.net<mailto:dbp-spotlight-developers@lists.sourceforge.net>>
Subject: Re: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC co-mentor]




On Thu, Mar 6, 2014 at 6:42 PM, Nicolas Torzec <torzecn@yahoo-inc.com<mailto:torzecn@yahoo-inc.com>> wrote:
Great idea and much needed move ;)

Really good to know that :) We don't have a direct use case for this idea, we just thought it would increase DBpedia usage in big data pipelines

Within the Hadoop platform, the MapReduce framework is focused on distributed batch processing.
Other frameworks are more focused on streaming…
=> Have you considered the pros and cons?

Actually no, this is one of the reasons we need the expert. We have a very general idea of the existing frameworks but cannot make that decision with confidence.

FYI, we are using the DBpedia Extraction framework at Yahoo Labs for some projects, and have been thinking about porting it to Hadoop for some time.
We may be able to help…

Since you (Yahoo Labs) can provide one of our use cases, it could all fit very well.
However, we'd need some commitment. The application period starts next week [1] and if we won't find anyone we'll have to drop this.

Regarding the workflow, we will provide the DBpedia know-how and the "expert" will have two tasks
1) Ensure that the student's application is technically good and, if the students gets accepted
2) periodically (weekly) check his progress during the coding period

Best,
Dimitris

[1] https://www.google-melange.com/gsoc/events/google/gsoc2014


--
Nicolas Torzec
Yahoo Labs


From: Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de<mailto:kontokostas@informatik.uni-leipzig.de>>
Date: Thursday, March 6, 2014 at 5:04 AM
To: "semantic-web@w3.org<mailto:semantic-web@w3.org>" <semantic-web@w3.org<mailto:semantic-web@w3.org>>, Linked Data community <public-lod@w3.org<mailto:public-lod@w3.org>>, DBpedia Discussions <dbpedia-discussion@lists.sourceforge.net<mailto:dbpedia-discussion@lists.sourceforge.net>>, DBpediaDevelopers <dbpedia-developers@lists.sourceforge.net<mailto:dbpedia-developers@lists.sourceforge.net>>, "dbp-spotlight-users@lists.sourceforge.net<mailto:dbp-spotlight-users@lists.sourceforge.net>" <dbp-spotlight-users@lists.sourceforge.net<mailto:dbp-spotlight-users@lists.sourceforge.net>>, DBpediaSpotlight Developers <dbp-spotlight-developers@lists.sourceforge.net<mailto:dbp-spotlight-developers@lists.sourceforge.net>>
Subject: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC co-mentor]

Dear all,

We want to adapt the DBpedia extraction framewok to work with a MapReduce framework. [1]

We want to implement this idea through GSoC 14 and already got two interested students [2] [3].
Unfortunately we are not experienced in this field and our existing contacts could not join. Thus,  we are looking for someone to help us mentor the technical aspects of this project.

About GSoC (http://en.wikipedia.org/wiki/GSoC)
The Google Summer of Code (GSoC) is an annual program, first held from May to August 2005,[1]<http://en.wikipedia.org/wiki/GSoC#cite_note-LinSOC-1> in which Google awards stipends (of US$5,500, as of 2014) to all students who successfully complete a requested free and open-source software coding project during the summer.
See some additional info on our page [4]

Best,
Dimitris

[1] http://wiki.dbpedia.org/gsoc2014/ideas/ExtractionwithMapReduce/
[2] student #1<http://sourceforge.net/mailarchive/forum.php?thread_name=CA%2Bu4%2Ba3g3dSd9L%3DM173hryYPp9HjwtNYgUU6Jcedy9MUAmzMVA%40mail.gmail.com&forum_name=dbpedia-gsoc>
[3] student #2<http://sourceforge.net/p/dbpedia/mailman/dbpedia-gsoc/thread/CAOk94WbB7%2BEzaWveP4OWCGeXvKdVUv790wAL%2BuRsoxTb1VEDeQ%40mail.gmail.com/#msg32063932>
[4] http://wiki.dbpedia.org/gsoc2014?v=kx0#h358-6


--
Dimitris Kontokostas
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
Homepage:http://aksw.org/DimitrisKontokostas

------------------------------------------------------------------------------
Subversion Kills Productivity. Get off Subversion & Make the Move to Perforce.
With Perforce, you get hassle-free workflows. Merge that actually works.
Faster operations. Version large binaries.  Built-in WAN optimization and the
freedom to use Git, Perforce or both. Make the move to Perforce.
http://pubads.g.doubleclick.net/gampad/clk?id=122218951&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net<mailto:Dbpedia-discussion@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion




--
Dimitris Kontokostas
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
Homepage:http://aksw.org/DimitrisKontokostas

Received on Friday, 7 March 2014 19:24:30 UTC