W3C home > Mailing lists > Public > public-bioschemas@w3.org > September 2018

Re: Worry to many Datasets => spam Was [Re: {Disarmed} Re: DataRecord and Dataset Search]

From: Gray, Alasdair J G <A.J.G.Gray@hw.ac.uk>
Date: Fri, 28 Sep 2018 09:10:06 +0000
To: "public-bioschemas@w3.org" <public-bioschemas@w3.org>
CC: "danbri@google.com" <danbri@google.com>, Vicki Tardif Holland <vtardif@google.com>, Natasha Noy <noy@google.com>
Message-ID: <771F482C-D229-4132-AC79-B99E2842F5A1@hw.ac.uk>

On 28 Sep 2018, at 09:36, Jerven Bolleman <jerven.bolleman@sib.swiss<mailto:jerven.bolleman@sib.swiss>> wrote:

Now that google dataset search exists I have a new worry of over using Dataset.

Take www.uniprot.org<http://www.uniprot.org/> as an example. It has a bit more than a billion webpages. Marking them all up with Dataset for what was a DataRecord before would mean we would have a bit over 3.5 billion Datasets.
Google has no problem with dealing with the volume, but I am worried that their antispam logic/relevance would drown out the 7 or so Datasets that I would like to see highly ranked in their toolbox search.

Jerven, this is a very valid concern and something that is likely to be far more problematic for any tooling we develop as a community than the major search engines.

Considering that most of this work is SEO related, I would vote to mark up just 1 page with DataCatalog/Dataset on www.uniprot.org<http://www.uniprot.org/> and not on the other pages.

Dan, can you give any insight from a search engine perspective?



Alasdair J G Gray
Associate Professor in Computer Science,
School of Mathematical and Computer Sciences
Heriot-Watt University, Edinburgh, UK.

Email: A.J.G.Gray@hw.ac.uk<mailto:A.J.G.Gray@hw.ac.uk>
Web: http://www.macs.hw.ac.uk/~ajg33
ORCID: http://orcid.org/0000-0002-5711-4872
Office: Earl Mountbatten Building 1.39
Twitter: @gray_alasdair


Heriot-Watt University is The Times & The Sunday Times International University of the Year 2018

Founded in 1821, Heriot-Watt is a leader in ideas and solutions. With campuses and students across the entire globe we span the world, delivering innovation and educational excellence in business, engineering, design and the physical, social and life sciences.

This email is generated from the Heriot-Watt University Group, which includes:

  1.  Heriot-Watt University, a Scottish charity registered under number SC000278
  2.  Edinburgh Business School a Charity Registered in Scotland, SC026900. Edinburgh Business School is a company limited by guarantee, registered in Scotland with registered number SC173556 and registered office at Heriot-Watt University Finance Office, Riccarton, Currie, Midlothian, EH14 4AS
  3.  Heriot- Watt Services Limited (Oriam), Scotland's national performance centre for sport. Heriot-Watt Services Limited is a private limited company registered is Scotland with registered number SC271030 and registered office at Research & Enterprise Services Heriot-Watt University, Riccarton, Edinburgh, EH14 4AS.

The contents (including any attachments) are confidential. If you are not the intended recipient of this e-mail, any disclosure, copying, distribution or use of its contents is strictly prohibited, and you should please notify the sender immediately and then delete it (including any attachments) from your system.
Received on Friday, 28 September 2018 09:10:37 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 19:08:06 UTC