W3C home > Mailing lists > Public > public-vocabs@w3.org > August 2014

Re: tool for extracting structured data in javascript?

From: Barbara Starr <barbarastarr2009@gmail.com>
Date: Sun, 10 Aug 2014 15:39:00 -0700
Cc: Melvin Carvalho <melvincarvalho@gmail.com>, Thad Guidry <thadguidry@gmail.com>, Stéphane Corlosquet <scorlosquet@gmail.com>, "public-vocabs@w3.org Vocabularies" <public-vocabs@w3.org>
Message-Id: <5480F08F-4D21-4250-9C3F-3E2B96264382@gmail.com>
To: Jarno van Driel <jarnovandriel@gmail.com>
Thanks for the correction.

And if there is not honesty, what else is there :)

On Aug 10, 2014, at 3:35 PM, Jarno van Driel <jarnovandriel@gmail.com> wrote:

> quick correction, metaseoinspector isn't by me but by www.omiod.com. I had nothing to do with it (have to be honest, eh).
> 
> 
> 2014-08-11 0:20 GMT+02:00 Barbara Starr <barbarastarr2009@gmail.com>:
> Hi Melvin,
> 
> In theory, you don't need to bloat your page with another js library, all you need to do is use the microdata dom api (a standard for  html5 I believe) specified here:
> 
> http://www.w3.org/TR/microdata/
> 
> and call document.getItems(typeNames) to get all the top level micro data items - Excerpt provided here:
> 
> "The type(s) of items can be obtained using the element.itemType IDL attribute on the element with the itemscope attribute.
> This sample shows how the getItems() method can be used to obtain a list of all the top-level microdata items of a particular type given in the document:
> 
> var cats = document.getItems("http://example.com/feline");
> "
> 
> From there you ought to be able to get the item properties as well
> 
> Not sure how widely this is implemented or adopted as a standard,  however that depends on html5 adoption, but certainly a handy little something to have in one's back pocket, and only 1 function call to get what you need. 
> 
> And just as a heads up, the green turtle tool which you installed (and is a chrome plugin) was listed on the article I provided http://searchengineland.com/see-entities-web-page-tools-help-194710 (it is default rdfa but can handle micro data as well if you enable it  - I thought Ivan Herman was involved in it's development tho).  I do find the visualization has a tendency to die if too many triples are harvested.
> 
> However if you do go the chrome plugin route, the metaseoinspector by Jarno Van Driel (which I find super useful), does tend to behave itself rather well (link below) - and was listed along with green turtle in the link I sent (however specific link also provided here)
> 
> https://chrome.google.com/webstore/detail/meta-seo-inspector/ibkclpciafdglkjkcibmohobjkcfkaef?hl=en
> 
> best regards
> 
> Barbara
> 
> 
> 
> 
> On Aug 10, 2014, at 2:02 PM, Melvin Carvalho <melvincarvalho@gmail.com> wrote:
> 
>> 
>> 
>> 
>> On 10 August 2014 16:42, Thad Guidry <thadguidry@gmail.com> wrote:
>> Melvin,
>> 
>> Here's a list of the current Javascript tools supporting Schema.org on Github (this is a filtered sorted-by-stars view made using Github):
>> 
>> https://github.com/search?l=JavaScript&o=desc&q=schema.org&ref=cmdform&s=stars&type=Repositories
>> 
>> Thanks, that's a great list.  Mhausenblas' project looks very interesting but seems not to have changed for a few years.  I've still yet to find a .js library that's lightweight, but I've got a few places to look now, cheers! :)
>>  
>> 
>> 
>> 
>> On Sun, Aug 10, 2014 at 9:09 AM, Stéphane Corlosquet <scorlosquet@gmail.com> wrote:
>> There is also Alex Milowski's chrome extension for Green Turtle: https://chrome.google.com/webstore/detail/green-turtle-rdfa/loggcajcfkpdeoaeihclldihfefijjam?hl=en (it is schema agnostic: it supports schema.org and any other vocabulary)
>> 
>> 
>> On Sun, Aug 10, 2014 at 9:42 AM, Melvin Carvalho <melvincarvalho@gmail.com> wrote:
>> 
>> 
>> 
>> On 9 August 2014 17:15, Barbara Starr <barbarastarr2009@gmail.com> wrote:
>> Hi Melvin
>> 
>> There are a bunch of handy chrome plugins that do that (if you are a chrome user) and you can see a list here: http://searchengineland.com/see-entities-web-page-tools-help-194710 (micro data reveal, semantic inspector and several others)
>> 
>> Very cool blog post, thanks for sharing!  I'll have to install some of these extensions.
>>  
>> 
>> There is also a rich snippets testing tool that is a bookmarklet, which you can find here: http://www.blindfiveyearold.com/rich-snippets-testing-tool-bookmarklet
>> 
>> That's great, however it requires running it through the google rich snippet server.  I was wondering if there's also an equivalent I could run locally?
>> 
>> Right now it's easy enough to run some jquery and look for $('[itemprop=]') etc. but I was wondering if there was something existing that I could reuse, too?
>>  
>> 
>> I find these tools pretty handy to use as I browse the web.
>> 
>> Hope that sort of covers what you are looking for
>> 
>> regards
>> 
>> Barbara
>> 
>> 
>> On Aug 9, 2014, at 8:02 AM, Melvin Carvalho <melvincarvalho@gmail.com> wrote:
>> 
>>> I was wondering if anyone had a tool to extract structured data from schema.org using javascript.  if there were a bookmarklet for example, that would be very useful.
>>> 
>>> 
>> 
>> 
>> 
>> 
>> 
>> -- 
>> Steph.
>> 
>> 
>> 
>> -- 
>> -Thad
>> +ThadGuidry
>> Thad on LinkedIn
>> 
> 
> 


Received on Sunday, 10 August 2014 22:39:31 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:49:34 UTC