- From: Colin Maudry <colin@maudry.com>
- Date: Wed, 15 Jun 2016 19:27:37 +0200
- To: SW-forum <semantic-web@w3.org>
- Message-ID: <57619009.6030709@maudry.com>
Hello Marco, Any multilingual support, or this is just for English? Thanks, Colin On 15/06/16 18:14, Marco Fossati wrote: > To whom it may interest, > > Full of delight, I would like to announce the first beta release of > *StrepHit*: > > https://github.com/Wikidata/StrepHit > > TL;DR: StrepHit is an intelligent reading agent that understands text > and translates it into *referenced* Wikidata statements. > It is a IEG project funded by the Wikimedia Foundation. > > Key features: > -Web spiders to harvest a collection of documents (corpus) from > reliable sources > -automatic corpus analysis to understand the most meaningful verbs > -sentences and semi-structured data extraction > -train a machine learning classifier via crowdsourcing > -*supervised and rule-based fact extraction from text* > -Natural Language Processing utilities > -parallel processing > > You can find all the details here: > https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Validation_via_References > > https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Validation_via_References/Midpoint > > > If you like it, star it on GitHub! > > Best, > > Marco >
Received on Wednesday, 15 June 2016 17:28:04 UTC