W3C home > Mailing lists > Public > public-vocabs@w3.org > August 2014

Re: tool for extracting structured data in javascript?

From: Stéphane Corlosquet <scorlosquet@gmail.com>
Date: Sun, 10 Aug 2014 23:48:54 -0400
Message-ID: <CAGR+nnHPR21m_jtcOVfUYHoc7Pk2FSf5=e8V9yTcHK-Rp24RUg@mail.gmail.com>
To: Barbara Starr <barbarastarr2009@gmail.com>
Cc: Melvin Carvalho <melvincarvalho@gmail.com>, Thad Guidry <thadguidry@gmail.com>, "public-vocabs@w3.org Vocabularies" <public-vocabs@w3.org>
On Sun, Aug 10, 2014 at 6:20 PM, Barbara Starr <barbarastarr2009@gmail.com>
wrote:

> Hi Melvin,
>
> In theory, you don't need to bloat your page with another js library, all
> you need to do is use the microdata dom api
>

just as a heads in case some try to look for this in their browser: the
microdata API is not implemented by any of the popular browsers (at least
I'm not aware of any). It was part of WebKit and Blink for a few months but
was removed. This post
<http://manu.sporny.org/2013/microdata-downward-spiral/> has more details
and links (while the facts in this post are accurate, be warned that it is
biased towards RDFa).

Steph.


> (a standard for  html5 I believe) specified here:
>
> http://www.w3.org/TR/microdata/
>
> and call document.getItems(typeNames)
> <http://www.w3.org/TR/microdata/#dom-document-getitems> to get all the
> top level micro data items - Excerpt provided here:
>
> "The type(s) of items <http://www.w3.org/TR/microdata/#concept-item> can
> be obtained using the element.itemType
> <http://www.w3.org/TR/microdata/#dom-itemtype> IDL attribute on the
> element with the itemscope
> <http://www.w3.org/TR/microdata/#attr-itemscope> attribute.
>
> This sample shows how the getItems()
> <http://www.w3.org/TR/microdata/#dom-document-getitems> method can be
> used to obtain a list of all the top-level microdata items of a particular
> type given in the document:
>
> var cats = document.getItems("http://example.com/feline");
>
> "
>
> From there you ought to be able to get the item properties as well
>
> Not sure how widely this is implemented or adopted as a standard,  however
> that depends on html5 adoption, but certainly a handy little something to
> have in one's back pocket, and only 1 function call to get what you need.
>
> And just as a heads up, the green turtle tool which you installed (and is
> a chrome plugin) was listed on the article I provided
> http://searchengineland.com/see-entities-web-page-tools-help-194710 (it
> is default rdfa but can handle micro data as well if you enable it  - I
> thought Ivan Herman was involved in it's development tho).  I do find the
> visualization has a tendency to die if too many triples are harvested.
>
> However if you do go the chrome plugin route, the metaseoinspector by
> Jarno Van Driel (which I find super useful), does tend to behave itself
> rather well (link below) - and was listed along with green turtle in the
> link I sent (however specific link also provided here)
>
>
> https://chrome.google.com/webstore/detail/meta-seo-inspector/ibkclpciafdglkjkcibmohobjkcfkaef?hl=en
>
> best regards
>
> Barbara
>
>
>
>
> On Aug 10, 2014, at 2:02 PM, Melvin Carvalho <melvincarvalho@gmail.com>
> wrote:
>
>
>
>
> On 10 August 2014 16:42, Thad Guidry <thadguidry@gmail.com> wrote:
>
>> Melvin,
>>
>> Here's a list of the current Javascript tools supporting Schema.org on
>> Github (this is a filtered sorted-by-stars view made using Github):
>>
>>
>> https://github.com/search?l=JavaScript&o=desc&q=schema.org&ref=cmdform&s=stars&type=Repositories
>>
>
> Thanks, that's a great list.  Mhausenblas' project looks very interesting
> but seems not to have changed for a few years.  I've still yet to find a
> .js library that's lightweight, but I've got a few places to look now,
> cheers! :)
>
>
>>
>>
>>
>> On Sun, Aug 10, 2014 at 9:09 AM, Stéphane Corlosquet <
>> scorlosquet@gmail.com> wrote:
>>
>>> There is also Alex Milowski's chrome extension for Green Turtle:
>>> https://chrome.google.com/webstore/detail/green-turtle-rdfa/loggcajcfkpdeoaeihclldihfefijjam?hl=en
>>> (it is schema agnostic: it supports schema.org and any other vocabulary)
>>>
>>>
>>> On Sun, Aug 10, 2014 at 9:42 AM, Melvin Carvalho <
>>> melvincarvalho@gmail.com> wrote:
>>>
>>>>
>>>>
>>>>
>>>> On 9 August 2014 17:15, Barbara Starr <barbarastarr2009@gmail.com>
>>>> wrote:
>>>>
>>>>> Hi Melvin
>>>>>
>>>>> There are a bunch of handy chrome plugins that do that (if you are a
>>>>> chrome user) and you can see a list here:
>>>>> http://searchengineland.com/see-entities-web-page-tools-help-194710 (micro
>>>>> data reveal, semantic inspector and several others)
>>>>>
>>>>
>>>> Very cool blog post, thanks for sharing!  I'll have to install some of
>>>> these extensions.
>>>>
>>>>
>>>>>
>>>>> There is also a rich snippets testing tool that is a bookmarklet,
>>>>> which you can find here:
>>>>> http://www.blindfiveyearold.com/rich-snippets-testing-tool-bookmarklet
>>>>>
>>>>
>>>> That's great, however it requires running it through the google rich
>>>> snippet server.  I was wondering if there's also an equivalent I could run
>>>> locally?
>>>>
>>>> Right now it's easy enough to run some jquery and look for
>>>> $('[itemprop=]') etc. but I was wondering if there was something existing
>>>> that I could reuse, too?
>>>>
>>>>
>>>>>
>>>>> I find these tools pretty handy to use as I browse the web.
>>>>>
>>>>> Hope that sort of covers what you are looking for
>>>>>
>>>>> regards
>>>>>
>>>>> Barbara
>>>>>
>>>>>
>>>>> On Aug 9, 2014, at 8:02 AM, Melvin Carvalho <melvincarvalho@gmail.com>
>>>>> wrote:
>>>>>
>>>>> I was wondering if anyone had a tool to extract structured data from
>>>>> schema.org using javascript.  if there were a bookmarklet for
>>>>> example, that would be very useful.
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>
>>>
>>> --
>>> Steph.
>>>
>>
>>
>>
>> --
>> -Thad
>> +ThadGuidry <https://www.google.com/+ThadGuidry>
>> Thad on LinkedIn <http://www.linkedin.com/in/thadguidry/>
>>
>
>
>


-- 
Steph.
Received on Monday, 11 August 2014 03:49:22 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:49:34 UTC