- From: Jerven Bolleman <jerven.bolleman@sib.swiss>
- Date: Fri, 28 Sep 2018 10:36:21 +0200
- To: "Gray, Alasdair J G" <A.J.G.Gray@hw.ac.uk>, public-bioschemas@w3.org
Hi Alasdair, All, Now that google dataset search exists I have a new worry of over using Dataset. Take www.uniprot.org as an example. It has a bit more than a billion webpages. Marking them all up with Dataset for what was a DataRecord before would mean we would have a bit over 3.5 billion Datasets. Google has no problem with dealing with the volume, but I am worried that their antispam logic/relevance would drown out the 7 or so Datasets that I would like to see highly ranked in their toolbox search. Considering that most of this work is SEO related, I would vote to mark up just 1 page with DataCatalog/Dataset on www.uniprot.org and not on the other pages. A more specific concept would be quite nice. May I suggest using a subtype to schema:StructuredValue e.g. bioschema:BioChemConcept. For example the schema:mainEntity on "https://wormbase.org/species/c_elegans/gene/WBGene00012939" would be of type schema:StructuredValue. In (hand-typed) JSON-LD roughly this. { "@context" : "http://schema.org", "@id" : "https://wormbase.org/species/c_elegans/gene/WBGene00012939" , "@type" : "Webpage" , "identifier" : "WBGene00012939", "mainEntity" : { "@type" : "StructuredValue" , "name" : "subs-4" , "hasPart" : { "@type" : "PropertyValue" , "propertyID" : "Sequence", "value" : "Y47D3B.1 " } } } Regards, Jerven On 09/28/2018 09:37 AM, Gray, Alasdair J G wrote: > Hi Karen, > >> On 27 Sep 2018, at 22:38, Karen Yook <karen@wormbase.org >> <mailto:karen@wormbase.org>> wrote: >> >> I just need to weigh in here as a voice in the Alliance of Genome >> Resources before anything gets finalized wrt to DataRecord or DataSet. >> While we are not tied to 'DataRecord' per se, we will need something >> other than just 'DataSet' to tag our pages. > > Can you elaborate on what you mean by, “we will need something other > than just ‘DataSet’ to tag our pages”? > >> >> We also believe specific distinctions via sub-types perhaps seems to >> be the preferred way to do things by bothschemas.org >> <http://schemas.org/>and Google. We >> will try to come up with a more specific proposal by or at the >> Biohackathon in Paris in a couple weeks. > > We would like to get these issues resolved before the hackathon so that > we can have stable core profiles for use in marking up with resources. > > Thanks > > Alasdair > > -- > Alasdair J G Gray > Associate Professor in Computer Science, > School of Mathematical and Computer Sciences > Heriot-Watt University, Edinburgh, UK. > > Email: A.J.G.Gray@hw.ac.uk <mailto:A.J.G.Gray@hw.ac.uk> > Web: http://www.macs.hw.ac.uk/~ajg33 > ORCID: http://orcid.org/0000-0002-5711-4872 > Office: Earl Mountbatten Building 1.39 > Twitter: @gray_alasdair > > Untitled Document > ------------------------------------------------------------------------ > > */Heriot-Watt University is The Times & The Sunday Times International > University of the Year 2018/* > > Founded in 1821, Heriot-Watt is a leader in ideas and solutions. With > campuses and students across the entire globe we span the world, > delivering innovation and educational excellence in business, > engineering, design and the physical, social and life sciences. > > This email is generated from the Heriot-Watt University Group, which > includes: > > 1. Heriot-Watt University, a Scottish charity registered under number > SC000278 > 2. Edinburgh Business School a Charity Registered in Scotland, > SC026900. Edinburgh Business School is a company limited by > guarantee, registered in Scotland with registered number SC173556 > and registered office at Heriot-Watt University Finance Office, > Riccarton, Currie, Midlothian, EH14 4AS > 3. Heriot- Watt Services Limited (Oriam), Scotland's national > performance centre for sport. Heriot-Watt Services Limited is a > private limited company registered is Scotland with registered > number SC271030 and registered office at Research & Enterprise > Services Heriot-Watt University, Riccarton, Edinburgh, EH14 4AS. > > The contents (including any attachments) are confidential. If you are > not the intended recipient of this e-mail, any disclosure, copying, > distribution or use of its contents is strictly prohibited, and you > should please notify the sender immediately and then delete it > (including any attachments) from your system. >
Received on Friday, 28 September 2018 08:36:52 UTC