W3C home > Mailing lists > Public > public-lod@w3.org > November 2015

Re: Are there any datasets about companies? ( DBpedia Open Data Initiative)

From: Giovanni Tummarello <g.tummarello@gmail.com>
Date: Tue, 3 Nov 2015 18:30:09 -0300
Message-ID: <CAHHRs7hNAOKuXLwtVURvV21kip1MAAB-jatZkN_rcvbeef38=w@mail.gmail.com>
To: Sebastian Hellmann <hellmann@informatik.uni-leipzig.de>
Cc: public-lod <public-lod@w3.org>, Kay Müller <kay.mueller@informatik.uni-leipzig.de>, Daniel Hladky <daniel.hladky@ontos.com>, Nandana Mihindukulasooriya <nmihindu@fi.upm.es>
Hi Sebastian, just for context

(i am collaborating with a leadingmarket data provider) there are 17 M+
organizations in italy alone (either alive or dead .. but maybe worth still
being in a database).

Maaaybe, just maaybe its worth tto talk to some of these organization and
campaign the opening up of a super minimal dataset e.g. just name,
registration city, status dead or alive.

The rationale is that they could receive more hits to get all the "rest of
the data" from paying customers.

but it will be quite difficult one has to come up with a good pitch, and a
lot of patience. Consider that permid seems to have one such super open
dataset so maybe that's a starting point.

Self catered "add your company" approaches, are not going to work in my
opinion.

Gio

On Tue, Nov 3, 2015 at 2:05 PM, Nandana Mihindukulasooriya <
nmihindu@fi.upm.es> wrote:

> Hi Sebastian,
>
> Open PermID and Open Calais [1,2] initiatives from Thomson Reuters with
> Linked Data + bulk download (CC-BY 4.0) might be of interest to your
> work. Brian Ulicny presented it in ISWC 2015 [3] and it has identifiers
> curated and maintained by Thomson Reuters for more than 3.5 million
> organizations .
>
> It also has several useful information about those organizations.
> http://tinyurl.com/permid-org-properties
> http://tinyurl.com/permid-triple-patterns
>
> Best Regards,
> Nandana
>
> [1] https://permid.org/faq
> [2] http://www.opencalais.com/about/
> [3] https://twitter.com/nandanamihindu/status/653232796874506240
>
> On Tue, Nov 3, 2015 at 4:17 PM, Sebastian Hellmann <
> hellmann@informatik.uni-leipzig.de> wrote:
>
>> [Apologies for cross-posting]
>>
>> Dear all,
>> this message is part announcement of an open data initiative and part
>> call for feedback and support.
>>
>> We are considering to work on creating a free, open and interoperable
>> dataset on companies and organisations, which we are planing to integrate
>> into DBpedia+ and offer as dump download. As we are in a very early phase
>> of the endeavour, we would like to know whether there is existing work in
>> this area.
>>
>> We are looking for any available datasets which have information about
>> companies and other organizations in any language and any country. Ideally,
>> the datasets are:
>> 1. downloadable as dump
>> 2. openly licensed , e.g. CC-BY following the
>> <http://opendefinition.org/>http://opendefinition.org/
>> 3. in an easily parseable format, e.g. RDF or CSV and not PDF
>>
>> But hey! Send around anything you know, and we will look at it and see
>> whether we can make use of it. You can reach us either by replying  to this
>> email or send feedback directly to me and Kay Müller
>> <kay.mueller@informatik.uni-leipzig.de>
>> <kay.mueller@informatik.uni-leipzig.de>.
>> If you have any private/closed data, please contact us as well. We might
>> make use of it to cross-reference and validate public/open data with it. Or
>> just learn from it to build a good scheme.
>>
>> We started a link collection here (and attached the current status at the
>> end of this email)
>>
>> https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit
>> Also we started to collect potential identifiers for linking here:
>>
>> https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0
>>
>> Regards and thank you for any support on this,
>> Sebastian and Kay
>>
>> ##############################
>>
>>
>> https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> * Open Company Data Open Company Data
>> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.buuo7dypfd9a>
>> Identifiers for companies/organisation
>> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.qs150ivpio94>
>> URIs (Linked Data/Semantic Web)
>> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.b9yeovqjeglz>
>> Downloadable Datasets with Company info (confirmed)
>> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.7ihxrlrypp14>
>> Portals with no bulk downloads
>> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.a95o85lqil72>
>> Portals, we will still need to investigate
>> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.p50bjh96q3ok>
>> Identifiers for companies/organisation Table with identifiers:
>> <https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0>https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0
>> <https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0>
>> URIs (Linked Data/Semantic Web) - DBpedia/Wikipedia/Wikidata URIs -
>> <http://dbpedia.org>http://dbpedia.org <http://dbpedia.org> - LinkedGeoData
>> - <http://linkedgeodata.org/>http://linkedgeodata.org/
>> <http://linkedgeodata.org/> Downloadable Datasets with Company info
>> (confirmed) - VIAF - <http://viaf.org/viaf/data/>http://viaf.org/viaf/data/
>> <http://viaf.org/viaf/data/> - DBpedia -
>> <http://downloads.dbpedia.org/current/core/>http://downloads.dbpedia.org/current/core/
>> <http://downloads.dbpedia.org/current/core/> - Wikidata -
>> <http://downloads.dbpedia.org/current/ext/wikidata/>http://downloads.dbpedia.org/current/ext/wikidata/
>> <http://downloads.dbpedia.org/current/ext/wikidata/> - LinkedGeoData -
>> <http://downloads.linkedgeodata.org/releases/>http://downloads.linkedgeodata.org/releases/
>> <http://downloads.linkedgeodata.org/releases/> - Company Data Index:
>> <http://index.okfn.org/dataset/companies/>http://index.okfn.org/dataset/companies/
>> <http://index.okfn.org/dataset/companies/> - e.g. UK company data:
>> <http://download.companieshouse.gov.uk/en_output.html>http://download.companieshouse.gov.uk/en_output.html
>> <http://download.companieshouse.gov.uk/en_output.html> Portals with no bulk
>> downloads - <https://opencorporates.com/>https://opencorporates.com/
>> <https://opencorporates.com/> -
>> <http://registries.opencorporates.com/>http://registries.opencorporates.com/
>> <http://registries.opencorporates.com/> Portals, we will still need to
>> investigate - <https://www.wlw.de/>https://www.wlw.de/
>> <https://www.wlw.de/> -
>> <https://www.crunchbase.com>https://www.crunchbase.com
>> <https://www.crunchbase.com> -
>> <http://data.crunchbase.com/v3/page/crunchbase-open-data-map-odm>http://data.crunchbase.com/v3/page/crunchbase-open-data-map-odm
>> <http://data.crunchbase.com/v3/page/crunchbase-open-data-map-odm> -
>> <http://www.industrystock.de>http://www.industrystock.de
>> <http://www.industrystock.de> - <http://www.ebr.org/>http://www.ebr.org/
>> <http://www.ebr.org/> -
>> <https://simfin.com/data/browse/companies>https://simfin.com/data/browse/companies
>> <https://simfin.com/data/browse/companies> -
>> <http://c-lei.org/>http://c-lei.org/ <http://c-lei.org/> -
>> <http://data.imf.org/>http://data.imf.org/ <http://data.imf.org/> -
>> <http://worldbank.270a.info/.html>http://worldbank.270a.info/.html
>> <http://worldbank.270a.info/.html> -
>> <http://datacatalog.worldbank.org/>http://datacatalog.worldbank.org/
>> <http://datacatalog.worldbank.org/> -
>> <http://www.europages.com/>http://www.europages.com/
>> <http://www.europages.com/> -
>> <http://www.sec.gov/data>http://www.sec.gov/data <http://www.sec.gov/data>
>> -
>> <http://faculty.philau.edu/russowl/industry.html>http://faculty.philau.edu/russowl/industry.html
>> <http://faculty.philau.edu/russowl/industry.html> - USA:
>> http://www.corporationwiki.com/ <http://www.corporationwiki.com/> - India:
>> http://www.companywiki.in/ <http://www.companywiki.in/> - Handelsregister:
>> www.Handelsregister.de <http://www.Handelsregister.de> - Creditreform:
>> http://www.creditsafetrial.com/de/?country=DE
>> <http://www.creditsafetrial.com/de/?country=DE> - Bürgel:
>> https://www.buergel.de/en <https://www.buergel.de/en> - Factiva:
>> https://global.factiva.com/factivalogin/login.asp?productname=global
>> <https://global.factiva.com/factivalogin/login.asp?productname=global> -
>> Interesting Links: - German
>> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-1/>http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-1/
>> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-1/>
>> -
>> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-2/>http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-2/
>> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-2/>
>> *
>>
>> --
>> Sebastian Hellmann
>> AKSW/KILT research group
>> Insitute for Applied Informatics (InfAI) at Leipzig University
>> DBpedia Association
>> Events:
>> * *Nov 20th, 2015* Extended Deadline for Quality Management of Semantic
>> Web Assets (Data, Services and Systems)
>> <http://www.semantic-web-journal.net/blog/call-papers-special-issue-quality-management-semantic-web-assets-data-services-and-systems>
>> Venha para a Alemanha como PhD:
>> <http://bis.informatik.uni-leipzig.de/csf>
>> http://bis.informatik.uni-leipzig.de/csf
>> Projects: http://dbpedia.org, http://nlp2rdf.org,
>> <http://linguistics.okfn.org>http://linguistics.okfn.org,
>> https://www.w3.org/community/ld4lt <http://www.w3.org/community/ld4lt>
>> Homepage: http://aksw.org/SebastianHellmann
>> Research Group: http://aksw.org
>> Thesis:
>> http://tinyurl.com/sh-thesis-summary
>> http://tinyurl.com/sh-thesis
>>
>
>
Received on Tuesday, 3 November 2015 21:31:02 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:22:27 UTC