W3C home > Mailing lists > Public > public-schemaorg@w3.org > September 2016

Re: sdo Software

From: Thad Guidry <thadguidry@gmail.com>
Date: Fri, 16 Sep 2016 09:54:20 -0500
Message-ID: <CAChbWaOVaZFvQGAV6gAQmtrMOy=RnbYm0QX60Tq4UqL+fBO+dQ@mail.gmail.com>
To: Phil Barker <phil.barker@hw.ac.uk>
Cc: "schema.org Mailing List" <public-schemaorg@w3.org>
Your request piqued my interest more, so did some hunting....found
something cool didn't know existed...

Looks like you can use scrapy-splash to do the heavy lifting of getting a
nice final rendered view to extract the JSON-LD from the <script> containers

https://blog.scrapinghub.com/2015/03/02/handling-javascript-in-scrapy-with-splash/
https://blog.scrapinghub.com/2016/01/19/scrapy-tips-from-the-pros-part-1/

Thad
+ThadGuidry <https://www.google.com/+ThadGuidry>
Received on Friday, 16 September 2016 14:54:58 UTC

This archive was generated by hypermail 2.3.1 : Friday, 16 September 2016 14:54:58 UTC