Re: Data Licensing from Dan Brickley on 2017-10-26 (public-schemaorg@w3.org from October 2017)

From: Dan Brickley <danbri@google.com>
Date: Thu, 26 Oct 2017 14:59:57 +0200
To: Martin Hepp <mfhepp@gmail.com>
Cc: Clifford Snow <clifford@snowandsnow.us>, Richard Wallis <richard.wallis@dataliberate.com>, "schema.org Mailing List" <public-schemaorg@w3.org>
Message-ID: <CAK-qy=4zorFa5wFkkrodzOF6jYN1kW19dXqf+xOGdOOq1LvMNw@mail.gmail.com>

On 26 October 2017 at 14:47, Martin Hepp <mfhepp@gmail.com> wrote:

> Hi Clifford:
>
> Frankly, you will not be able to get that assurance in any global way. The
> IPR issues with crawling and extracting Web information are non-trivial and
> reason for several lawsuits and bilateral agreements. Schema.org cannot
> make any such statements, for the sponsors just provide the vocabulary, not
> the data.
>
> See here:
>
>     https://benbernardblog.com/web-scraping-and-crawling-are-
> perfectly-legal-right/
>
> for a few links.
>
> That is the reason why e.g. republishing data gained from Web crawls is
> problematic in research projects.
>
> One neat idea would be, however, for the sponsors of schema.org to change
> the license of schema.org to a "copyleft" one, i.e. by using schema.org
> on your Web site, you attache a liberal license to your content. Not the
> most friendly move, but maybe one that will save us a lot of trouble in the
> long run.
>

I'm not going to get into theoretical lawyering on this list, but will say
personally that I don't see that as feasible or likely...

(Listmembers who do want to pursue these kinds of ideas might talk to the
POE WG folk at W3C, https://www.w3.org/2016/poe/wiki/Main_Page )

cheers,

Dan

Received on Thursday, 26 October 2017 13:00:32 UTC