Re: Data Licensing

Hi Clifford:

Frankly, you will not be able to get that assurance in any global way. The IPR issues with crawling and extracting Web information are non-trivial and reason for several lawsuits and bilateral agreements. Schema.org cannot make any such statements, for the sponsors just provide the vocabulary, not the data.

See here:

    https://benbernardblog.com/web-scraping-and-crawling-are-perfectly-legal-right/

for a few links.

That is the reason why e.g. republishing data gained from Web crawls is problematic in research projects.

One neat idea would be, however, for the sponsors of schema.org to change the license of schema.org to a "copyleft" one, i.e. by using schema.org on your Web site, you attache a liberal license to your content. Not the most friendly move, but maybe one that will save us a lot of trouble in the long run.

Best

Martin

Disclaimer: This is not legal advice, just a pointer to potential problems.

-----------------------------------
martin hepp  http://www.heppnetz.de
mhepp@computer.org          @mfhepp




> On 26 Oct 2017, at 15:31, Clifford Snow <clifford@snowandsnow.us> wrote:
> 
> Dan,
> You are correct, I'm interested in the content from the schema, not the actual schema. It seems obvious, at least to me, that the website owners want the data used and redistributed. But OSM takes a conservative approach to licensing. I was looking for a statement that says something like "...grants you the right to use and redistribute the data..." when the schema is lacking a licence property. 
> 
> Clifford
> 
> On Thu, Oct 26, 2017 at 2:47 AM, Dan Brickley <danbri@google.com> wrote:
> Just to add: it sounded to me like Clifford is interested in actual content of sites and data sources that use schema.org.
> 
> That complex topic goes beyond the scope of our project. Schema.org markup in email messages, or in public Web pages, for example are quite different. Robots.txt is important for the latter...
> 
> Dan
> 
> On 26 Oct 2017 06:20, "Richard Wallis" <richard.wallis@dataliberate.com> wrote:
> Hi Clifford,
> 
> I believe this FAQ should help:
> 
> Q: Under what terms can we re-use this documentation (and schemas, examples, software)?
> As noted in our terms of service document, schema.org schemas are made available under the Creative Commons Attribution-ShareAlike License (version 3.0). These terms also apply to the supporting documentation on this site and on our blog, as well as to the markup examples used in this site. The software used for our site is also available for opensource re-use under an Apache 2 license. If you have feedback on these terms please get in touch.
> 
> ~Richard
> 
> 
> 
> Richard Wallis
> Founder, Data Liberate
> http://dataliberate.com
> Linkedin: http://www.linkedin.com/in/richardwallis
> Twitter: @rjw
> 
> On 26 October 2017 at 04:45, Clifford Snow <clifford@snowandsnow.us> wrote:
> I've been looking over the schema.org website trying to find out how the data is licensed. It seems like it should be public domain or something similar.  There is a property "license" which is shown as being used by 1000 to 10,000 domains. Is there documentation that websites using the schema agree to a license?
> 
> There is strong interest by some in the OpenStreetMap community that it would make sense to get schema data when adding a local business. Because OSM can not accept copyright material, we can not pull in data that violates our policies. 
> 
> Having the ability to pull in data would help us improve the quality and populate more OSM tags, such as addresses and opening hours. 
> 
> Thanks in advance,
> Clifford
> 
> -- 
> @osm_seattle
> osm_seattle.snowandsnow.us
> OpenStreetMap: Maps with a human touch
> 
> 
> 
> 
> -- 
> @osm_seattle
> osm_seattle.snowandsnow.us
> OpenStreetMap: Maps with a human touch

Received on Thursday, 26 October 2017 12:47:35 UTC