W3C home > Mailing lists > Public > public-architypes@w3.org > February 2017

Re: Discussion about previous proposal

From: Owen Stephens <owen@ostephens.com>
Date: Wed, 15 Feb 2017 11:06:09 +0000
Message-Id: <001B57FD-4AD9-47BB-B94C-FF6EEA75F531@ostephens.com>
Cc: public-architypes <public-architypes@w3.org>
To: Richard Wallis <richard.wallis@dataliberate.com>
Thanks Dan and Richard,

I definitely felt when working on the bibextend schema.org group that working from existing representations on the web was a useful starting point.
It probably helps with Adrian’s question as well - what can be done now.

To this end, I’ve put some HTML excerpts from the ArchivesHub (begging Adrian and Jane’s pardon!) on the wiki. I’ve done excerpts for Archive Collections, Sections, and Items both in search results lists and for the full view. This is at https://www.w3.org/community/architypes/wiki/Sample_Archives_HTML_excerpts

I’ve chosen examples from a particular Archive collection picked at random - so not sure if this is “typical” (maybe there isn’t a typical archive!)

I’m hoping that working from this kind of example might help ground this work in practical examples - if others want to add examples from other sources it would give us some good material to start looking at I think


Owen Stephens
Owen Stephens Consulting
Web: http://www.ostephens.com
Email: owen@ostephens.com
Telephone: 0121 288 6936

> On 14 Feb 2017, at 17:17, Richard Wallis <richard.wallis@dataliberate.com> wrote:
> Thank you @danbri for asking the general schema.org-esque questions, it always places, such discussions as this, in contexts.
> Also stepping back to first steps of this, and other Schema.org oriented community groups I have participated in and chair…..
> Much of the structured data shared on the web, in about 30% of pages from millions of domains (a useful overview of a subset can be seen at webdatacommons.org <http://webdatacommons.org/structureddata/2016-10/stats/stats.html>), uses the Shema.org vocabulary.  That data is harvested in to the knowledge graph and similar technologies hosted by the search engines.
> The search engines use this data to drive many services alongside providing SERPs, in fact up until recently it was explicitly stated by some that Schema.org data did not influence SERPs ratings. These other services however are believed to improved the visibility and discoverability on the web.
> Much of the archival resource we are talking about does not have any such data openly published in a way that is easily consumed by the search engines, and is therefore invisible to them.  Consulting the advisory pages provided by the search engines, they advise including structured data using Schema.org in one of its supported serialisation.
> Assuming the above leads you want to publish Schema.org data about your resources, the high level approach I have found from working with several community groups and on consultancy & training engagements is as follows:
> Using Schema.org as it is currently described, try to mark up your resources describing as fully as possible the attributes of those resources that would be of relevance and interest to search engines and the wider web.
> Identify what elements are missing from Schema that are preventing you from doing that.
> Can these missing elements be overcome by enhancements to current terms - adjusting term definitions, recommending the use of a combination of types, expanding the domain and/or range of current properties.
> Finally, propose new terms to extend the vocabulary.
> ~Richard
> Richard Wallis
> Founder, Data Liberate
> http://dataliberate.com <http://dataliberate.com/>
> Linkedin: http://www.linkedin.com/in/richardwallis <http://www.linkedin.com/in/richardwallis>
> Twitter: @rjw
> On 14 February 2017 at 16:34, Dan Brickley <danbri@google.com <mailto:danbri@google.com>> wrote:
> Backing up a bit, and asking some general schema.org-esque questions:
> Are there currently existing public sites that publish this sort of
> structured information? Are there any examples of these (sites +
> typical pages) collected?
> Dan
Received on Wednesday, 15 February 2017 11:06:44 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 8 August 2018 13:28:59 UTC