W3C home > Mailing lists > Public > semantic-web@w3.org > March 2014

Re: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC co-mentor]

From: Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de>
Date: Fri, 7 Mar 2014 09:32:46 +0200
Message-ID: <CA+u4+a2m1DCk3aUj=VzEqkAisPn3S+xCXMt+U5eJX=_ZeoREbQ@mail.gmail.com>
To: Nicolas Torzec <torzecn@yahoo-inc.com>
Cc: "semantic-web@w3.org" <semantic-web@w3.org>, Linked Data community <public-lod@w3.org>, "dbpedia-discussion@lists.sourceforge.net" <dbpedia-discussion@lists.sourceforge.net>, DBpediaDevelopers <dbpedia-developers@lists.sourceforge.net>, "dbp-spotlight-users@lists.sourceforge.net" <dbp-spotlight-users@lists.sourceforge.net>, DBpediaSpotlight Developers <dbp-spotlight-developers@lists.sourceforge.net>
On Thu, Mar 6, 2014 at 6:42 PM, Nicolas Torzec <torzecn@yahoo-inc.com>wrote:

>  Great idea and much needed move ;)
>

Really good to know that :) We don't have a direct use case for this idea,
we just thought it would increase DBpedia usage in big data pipelines


>  Within the Hadoop platform, the MapReduce framework is focused on
> distributed batch processing.
> Other frameworks are more focused on streaming…
> => Have you considered the pros and cons?
>

Actually no, this is one of the reasons we need the expert. We have a very
general idea of the existing frameworks but cannot make that decision with
confidence.


> FYI, we are using the DBpedia Extraction framework at Yahoo Labs for some
> projects, and have been thinking about porting it to Hadoop for some time.
> We may be able to help…
>

Since you (Yahoo Labs) can provide one of our use cases, it could all fit
very well.
However, we'd need some commitment. The application period starts next week
[1] and if we won't find anyone we'll have to drop this.

Regarding the workflow, we will provide the DBpedia know-how and the
"expert" will have two tasks
1) Ensure that the student's application is technically good and, if the
students gets accepted
2) periodically (weekly) check his progress during the coding period

Best,
Dimitris

[1] https://www.google-melange.com/gsoc/events/google/gsoc2014


>
>  --
> Nicolas Torzec
> Yahoo Labs
>
>
>   From: Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de>
> Date: Thursday, March 6, 2014 at 5:04 AM
> To: "semantic-web@w3.org" <semantic-web@w3.org>, Linked Data community <
> public-lod@w3.org>, DBpedia Discussions <
> dbpedia-discussion@lists.sourceforge.net>, DBpediaDevelopers <
> dbpedia-developers@lists.sourceforge.net>, "
> dbp-spotlight-users@lists.sourceforge.net" <
> dbp-spotlight-users@lists.sourceforge.net>, DBpediaSpotlight Developers <
> dbp-spotlight-developers@lists.sourceforge.net>
> Subject: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as
> GSoC co-mentor]
>
>   Dear all,
>
>  We want to adapt the DBpedia extraction framewok to work with a
> MapReduce framework. [1]
>
>  We want to implement this idea through GSoC 14 and already got two
> interested students [2] [3].
> Unfortunately we are not experienced in this field and our existing
> contacts could not join. Thus,  we are looking for someone to help us
> mentor the technical aspects of this project.
>
>  About GSoC (http://en.wikipedia.org/wiki/GSoC)
>  The *Google Summer of Code* (*GSoC*) is an annual program, first held
> from May to August 2005,[1]<http://en.wikipedia.org/wiki/GSoC#cite_note-LinSOC-1> in
> which Google awards stipends (of US$5,500, as of 2014) to all students
> who successfully complete a requested free and open-source software coding
> project during the summer.
>  See some additional info on our page [4]
>
>  Best,
> Dimitris
>
> [1] http://wiki.dbpedia.org/gsoc2014/ideas/ExtractionwithMapReduce/
> [2] student #1<http://sourceforge.net/mailarchive/forum.php?thread_name=CA%2Bu4%2Ba3g3dSd9L%3DM173hryYPp9HjwtNYgUU6Jcedy9MUAmzMVA%40mail.gmail.com&forum_name=dbpedia-gsoc>
> [3] student #2<http://sourceforge.net/p/dbpedia/mailman/dbpedia-gsoc/thread/CAOk94WbB7%2BEzaWveP4OWCGeXvKdVUv790wAL%2BuRsoxTb1VEDeQ%40mail.gmail.com/#msg32063932>
> [4] http://wiki.dbpedia.org/gsoc2014?v=kx0#h358-6
>
>
>  --
> Dimitris Kontokostas
> Department of Computer Science, University of Leipzig
> Research Group: http://aksw.org
> Homepage:http://aksw.org/DimitrisKontokostas
>
>
> ------------------------------------------------------------------------------
> Subversion Kills Productivity. Get off Subversion & Make the Move to
> Perforce.
> With Perforce, you get hassle-free workflows. Merge that actually works.
> Faster operations. Version large binaries.  Built-in WAN optimization and
> the
> freedom to use Git, Perforce or both. Make the move to Perforce.
>
> http://pubads.g.doubleclick.net/gampad/clk?id=122218951&iu=/4140/ostg.clktrk
> _______________________________________________
> Dbpedia-discussion mailing list
> Dbpedia-discussion@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>


-- 
Dimitris Kontokostas
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
Homepage:http://aksw.org/DimitrisKontokostas
Received on Friday, 7 March 2014 07:33:42 UTC

This archive was generated by hypermail 2.4.0 : Tuesday, 5 July 2022 08:45:36 UTC