- From: Young,Jeff (OR) <jyoung@oclc.org>
- Date: Mon, 10 Aug 2015 15:28:27 +0000
- To: "LeVan,Ralph" <levan@oclc.org>, Richard Wallis <richard.wallis@dataliberate.com>, "public-schemabibex@w3.org" <public-schemabibex@w3.org>
- Message-ID: <BLUPR06MB1290E060A8F09A59312ADA9AD700@BLUPR06MB129.namprd06.prod.outlook.com>
I made an argument that the problem is broader than bib records: https://github.com/schemaorg/schemaorg/issues/700#issuecomment-129078302 Limiting to our situation, though, Richard cites the count from WorldCat at 72 million “agents” (people and organizations excluded): https://github.com/schemaorg/schemaorg/issues/700#issuecomment-129227478 These all have Linked Data identifiers, but they are only mechanized placeholders in need of exposure, reconciliation, and enrichment. The danger of not sorting these out is that naïve automated “entity matching” processes resort to string matching on name as an “else condition” and the resulting mix up manifests itself in the Linked Data. I suggested Google Custom Search as a possible tool to help with discovery and possibly lead to an interface where they could be reconciled: https://github.com/schemaorg/schemaorg/issues/700#issuecomment-129239474 Jeff From: LeVan,Ralph Sent: Monday, August 10, 2015 10:33 AM To: Young,Jeff (OR); Richard Wallis; public-schemabibex@w3.org Subject: RE: The Agent proposal in bib.schema.org is controversial One of the arguments against Agent was that if you didn’t know what kind of object a thing was, then you just shouldn’t say. All the properties of Agent seem to come from Thing. I’d propose that we just use Thing. My guess is that the need for Agent comes mostly from our need to convert existing bib records into RDF and some of our crappy old bib records don’t reliably distinguish the type of agent involved. Rather than be caught out in a lie about whether the agent is a Person or Organization, we’d rather say less. This is a problem peculiar to our situation and not a broad problem of the internet community. It’s also a short-term problem. Selling ‘Agent’ to a community that doesn’t need it is going to be an uphill battle. What’s wrong with dropping all the way back to Thing when we don’t know the type of the agent? Ralph From: Young,Jeff (OR) [mailto:jyoung@oclc.org] Sent: Monday, August 10, 2015 10:04 AM To: Richard Wallis; public-schemabibex@w3.org<mailto:public-schemabibex@w3.org> Subject: RE: The Agent proposal in bib.schema.org is controversial One option would be for us to use foaf:Agent. Presumably search engines would ignore it, but that’s their prerogative. Another option would be to preserve http://bibliograph.net/Agent, with a comment that it wasn’t accepted by the broader community, but remains useful in our limited domain. (Terms that have been adopted should be deprecated.) Jeff From: Richard Wallis [mailto:richard.wallis@dataliberate.com] Sent: Monday, August 10, 2015 8:18 AM To: public-schemabibex@w3.org<mailto:public-schemabibex@w3.org> Subject: The Agent proposal in bib.schema.org is controversial You may have noticed if you followed the recent announcement of Schema.or v2.1<https://lists.w3.org/Archives/Public/public-schemabibex/2015Aug/0000.html>, which includes bib.schema.org<http://bib.schema.org>, that one of our proposals did not make it in. That proposal being the Agent type that we proposed as a super-type for Person and Organization. Agent has been a theme of discussion in the community well before we approached the issue. You can follow the recent debate in the related schemaorg git issue comment trail: https://github.com/schemaorg/schemaorg/issues/700 In the bibliographic world Agent is a well understood, some would say obvious, approach. When applied to the wider domains that Schema.org embraces however, it raises many concerns and issues. Especially because, as proposed, it would introduce a new direct sub-type of Thing with ramifications that could cascade across many areas of the vocabulary. In my personal opinion the gap between the two apposing views on this is significant and the best way forward would be to consider possible pragmatic approaches to how we represent our data in Schema.org without loosing the ability to describe our resources effectively to the wider world. In simple terms, if we identify an author, creator, publisher, or even copyright holder as a Person or an Organization there is not a problem. The difficulty occurs when we know from the relationships in the data that they are either a Person or an Organization but cannot identify which. One suggested way forward for such a circumstance would be to define them as a schema:Thing. To me this feels a little too vague. A follow-on option was to suggest a 'personOrOrganization' boolean property to indicate this circumstance. This is a little more appealing, but I think it still needs some work. What are others thoughts on this? Do we believe that the proposed Agent type is the only way forward? Are there potential pragmatic options like the one I describe above that we could shape, that would be acceptable? Is this requirement to specifically describe agents as too detailed and something we can pass over, and move on to other things? ~Richard. Richard Wallis Founder, Data Liberate http://dataliberate.com Linkedin: http://www.linkedin.com/in/richardwallis Twitter: @rjw
Received on Monday, 10 August 2015 15:29:10 UTC