Re: Are there any datasets about companies? ( DBpedia Open Data Initiative)

Sebastian,

Here a few thoughts:

In Switzerland you can try to scrape http://www.zefix.ch/

Otherwise other company databases that I am aware off are under commercial
license.

Swiss commercial data providers are:
http://www.bisnode.de/product/firmendatenbank/
http://ch.kompass.com/

On a ww scale you buy data from Bloomberg alike.

Besides the known opencorporate, dbpedia etc approach I am not aware of
another open db.
In general this is big business for certain company data providers.

Cheers, Daniel


On Tue, Nov 3, 2015 at 4:17 PM, Sebastian Hellmann <
hellmann@informatik.uni-leipzig.de> wrote:

> [Apologies for cross-posting]
>
> Dear all,
> this message is part announcement of an open data initiative and part call
> for feedback and support.
>
> We are considering to work on creating a free, open and interoperable
> dataset on companies and organisations, which we are planing to integrate
> into DBpedia+ and offer as dump download. As we are in a very early phase
> of the endeavour, we would like to know whether there is existing work in
> this area.
>
> We are looking for any available datasets which have information about
> companies and other organizations in any language and any country. Ideally,
> the datasets are:
> 1. downloadable as dump
> 2. openly licensed , e.g. CC-BY following the <http://opendefinition.org/>
> http://opendefinition.org/
> 3. in an easily parseable format, e.g. RDF or CSV and not PDF
>
> But hey! Send around anything you know, and we will look at it and see
> whether we can make use of it. You can reach us either by replying  to this
> email or send feedback directly to me and Kay Müller
> <kay.mueller@informatik.uni-leipzig.de>
> <kay.mueller@informatik.uni-leipzig.de>.
> If you have any private/closed data, please contact us as well. We might
> make use of it to cross-reference and validate public/open data with it. Or
> just learn from it to build a good scheme.
>
> We started a link collection here (and attached the current status at the
> end of this email)
>
> https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit
> Also we started to collect potential identifiers for linking here:
>
> https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0
>
> Regards and thank you for any support on this,
> Sebastian and Kay
>
> ##############################
>
>
> https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
> * Open Company Data Open Company Data
> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.buuo7dypfd9a>
> Identifiers for companies/organisation
> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.qs150ivpio94>
> URIs (Linked Data/Semantic Web)
> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.b9yeovqjeglz>
> Downloadable Datasets with Company info (confirmed)
> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.7ihxrlrypp14>
> Portals with no bulk downloads
> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.a95o85lqil72>
> Portals, we will still need to investigate
> <https://docs.google.com/document/d/1IaWSSt4_SZVhypvB1QzBlCtBuMQHv-q5Ti0n8xoZFIQ/edit#heading=h.p50bjh96q3ok>
> Identifiers for companies/organisation Table with identifiers:
> <https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0>https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0
> <https://docs.google.com/spreadsheets/d/1EMqemA1BlqvyOXGLzYbvY0IcBCAhaRd5XgYLMWIxGsA/edit#gid=0>
> URIs (Linked Data/Semantic Web) - DBpedia/Wikipedia/Wikidata URIs -
> <http://dbpedia.org>http://dbpedia.org <http://dbpedia.org> - LinkedGeoData
> - <http://linkedgeodata.org/>http://linkedgeodata.org/
> <http://linkedgeodata.org/> Downloadable Datasets with Company info
> (confirmed) - VIAF - <http://viaf.org/viaf/data/>http://viaf.org/viaf/data/
> <http://viaf.org/viaf/data/> - DBpedia -
> <http://downloads.dbpedia.org/current/core/>http://downloads.dbpedia.org/current/core/
> <http://downloads.dbpedia.org/current/core/> - Wikidata -
> <http://downloads.dbpedia.org/current/ext/wikidata/>http://downloads.dbpedia.org/current/ext/wikidata/
> <http://downloads.dbpedia.org/current/ext/wikidata/> - LinkedGeoData -
> <http://downloads.linkedgeodata.org/releases/>http://downloads.linkedgeodata.org/releases/
> <http://downloads.linkedgeodata.org/releases/> - Company Data Index:
> <http://index.okfn.org/dataset/companies/>http://index.okfn.org/dataset/companies/
> <http://index.okfn.org/dataset/companies/> - e.g. UK company data:
> <http://download.companieshouse.gov.uk/en_output.html>http://download.companieshouse.gov.uk/en_output.html
> <http://download.companieshouse.gov.uk/en_output.html> Portals with no bulk
> downloads - <https://opencorporates.com/>https://opencorporates.com/
> <https://opencorporates.com/> -
> <http://registries.opencorporates.com/>http://registries.opencorporates.com/
> <http://registries.opencorporates.com/> Portals, we will still need to
> investigate - <https://www.wlw.de/>https://www.wlw.de/
> <https://www.wlw.de/> -
> <https://www.crunchbase.com>https://www.crunchbase.com
> <https://www.crunchbase.com> -
> <http://data.crunchbase.com/v3/page/crunchbase-open-data-map-odm>http://data.crunchbase.com/v3/page/crunchbase-open-data-map-odm
> <http://data.crunchbase.com/v3/page/crunchbase-open-data-map-odm> -
> <http://www.industrystock.de>http://www.industrystock.de
> <http://www.industrystock.de> - <http://www.ebr.org/>http://www.ebr.org/
> <http://www.ebr.org/> -
> <https://simfin.com/data/browse/companies>https://simfin.com/data/browse/companies
> <https://simfin.com/data/browse/companies> -
> <http://c-lei.org/>http://c-lei.org/ <http://c-lei.org/> -
> <http://data.imf.org/>http://data.imf.org/ <http://data.imf.org/> -
> <http://worldbank.270a.info/.html>http://worldbank.270a.info/.html
> <http://worldbank.270a.info/.html> -
> <http://datacatalog.worldbank.org/>http://datacatalog.worldbank.org/
> <http://datacatalog.worldbank.org/> -
> <http://www.europages.com/>http://www.europages.com/
> <http://www.europages.com/> -
> <http://www.sec.gov/data>http://www.sec.gov/data <http://www.sec.gov/data>
> -
> <http://faculty.philau.edu/russowl/industry.html>http://faculty.philau.edu/russowl/industry.html
> <http://faculty.philau.edu/russowl/industry.html> - USA:
> http://www.corporationwiki.com/ <http://www.corporationwiki.com/> - India:
> http://www.companywiki.in/ <http://www.companywiki.in/> - Handelsregister:
> www.Handelsregister.de <http://www.Handelsregister.de> - Creditreform:
> http://www.creditsafetrial.com/de/?country=DE
> <http://www.creditsafetrial.com/de/?country=DE> - Bürgel:
> https://www.buergel.de/en <https://www.buergel.de/en> - Factiva:
> https://global.factiva.com/factivalogin/login.asp?productname=global
> <https://global.factiva.com/factivalogin/login.asp?productname=global> -
> Interesting Links: - German
> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-1/>http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-1/
> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-1/>
> -
> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-2/>http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-2/
> <http://get.torial.com/blog/2014/02/die-besten-quellen-fuer-wirtschaftsjournalisten-teil-2/>
> *
>
> --
> Sebastian Hellmann
> AKSW/KILT research group
> Insitute for Applied Informatics (InfAI) at Leipzig University
> DBpedia Association
> Events:
> * *Nov 20th, 2015* Extended Deadline for Quality Management of Semantic
> Web Assets (Data, Services and Systems)
> <http://www.semantic-web-journal.net/blog/call-papers-special-issue-quality-management-semantic-web-assets-data-services-and-systems>
> Venha para a Alemanha como PhD: <http://bis.informatik.uni-leipzig.de/csf>
> http://bis.informatik.uni-leipzig.de/csf
> Projects: http://dbpedia.org, http://nlp2rdf.org,
> <http://linguistics.okfn.org>http://linguistics.okfn.org,
> https://www.w3.org/community/ld4lt <http://www.w3.org/community/ld4lt>
> Homepage: http://aksw.org/SebastianHellmann
> Research Group: http://aksw.org
> Thesis:
> http://tinyurl.com/sh-thesis-summary
> http://tinyurl.com/sh-thesis
>



-- 

*Daniel Hladky, *CEO


*Ontos AG                                                    Ontos
GmbH*Mittelstrasse
24, 2560 Nidau, Switzerland          Wurzner. Str 154A, 04318 Leipzig,
Germany
*E* daniel.hladky@ontos.com <daniel.hladky@ontos.co%21%0d%0a%20mh>

*Amtsgericht Leipzig, HRB 25146**M* +41 79 3535043       *T  *+41 32
3329250         *T  *+49 341 21559-10
Verwaltungsrat: Daniel Hladky                            Geschäftsführer:
Daniel Hladky, Dr. Martin Voigt
*W  *www.ontos.com
*_________________________________________________________________________________*

This e-mail (including any attachments) contains confidential information
and may be privileged or
otherwise protected from disclosure. The information contained herein is
intended for the use of the
intended addressee only. Please be aware that any disclosure,
copying, imitation, distribution,
dissemination or other use of the content of this e-mail, either by
non-intended recipients or by the
intended recipient for any other than the intended purpose of forwarding,
is prohibited. If you received
this e-mail by error, please notify the sender immediately by reply e-mail
and delete this e-mail and
any attachments from your system. Thank you for your co-operation.  We also
like to inform
you that communication via e-mail over the Internet is insecure because
third parties may have the
possibility to access and manipulate e-mails.

Received on Tuesday, 3 November 2015 16:41:01 UTC