Re: A question on RWPM: why the 'metadata' tag?

> On 10 Jan 2018, at 16:34, Benjamin Young <byoung@bigbluehat.com <mailto:byoung@bigbluehat.com>> wrote:
> 
> Then it's terrible JSON-LD. :)

Not terrible. Just erroneous, just like an syntactically incorrect Turtle or badly formed RDF/XML file is unusable as RDF.

> 
> However, practically, a JSON-LD document may contain properties not referenced in the @context, and those would only have value when that document is treated as JSON. There are good reasons to do this sometimes, but generally, you want to make sure the meaningful bits keep their meaning when moved between serializations and storage options.

… but that handling of those (ie, ignoring them) is part of the standard. Ie, the result in RDF is predictable and well defined.

> 
> Quoting from the JSON-LD spec to hopefully clarify:
> "JSON-LD is a concrete RDF syntax as described in [RDF11-CONCEPTS]. Hence, a JSON-LD document is both an RDF document and a JSON document and correspondingly represents an     instance of an RDF data model."
> https://json-ld.org/spec/latest/json-ld/#relationship-to-rdf <https://json-ld.org/spec/latest/json-ld/#relationship-to-rdf>

Ivan

P.S. Just to be very picky: in fact, JSON-LD is a serialization for RDF Datasets (not only single RDF Graphs) and it can also encode "generalized RDF", meaning that properties can be blank nodes (which is not accepted by RDF). But those are well documented exceptions and uninteresting for common mortals:-)



> 
> --
> http://bigbluehat.com/ <http://bigbluehat.com/>
> http://linkedin.com/in/benjaminyoung <http://linkedin.com/in/benjaminyoung>
> From: Leonard Rosenthol <lrosenth@adobe.com <mailto:lrosenth@adobe.com>>
> Sent: Wednesday, January 10, 2018 10:16:13 AM
> To: David Hyland-Wood; Ivan Herman
> Cc: Hadrien Gardeur; W3C Publishing Working Group
> Subject: Re: A question on RWPM: why the 'metadata' tag?
> 
> I would say that your statement is only true *IF* your JSON-LD has been “well-formed” in order to do so – but not all JSON-LD can be treated as RDF.
> 
> From: David Hyland-Wood <david@hyland-wood.org <mailto:david@hyland-wood.org>>
> Date: Wednesday, January 10, 2018 at 4:25 AM
> To: Leonard Rosenthol <lrosenth@adobe.com <mailto:lrosenth@adobe.com>>, Ivan Herman <ivan@w3.org <mailto:ivan@w3.org>>
> Cc: Hadrien Gardeur <hadrien.gardeur@feedbooks.com <mailto:hadrien.gardeur@feedbooks.com>>, W3C Publishing Working Group <public-publ-wg@w3.org <mailto:public-publ-wg@w3.org>>
> Subject: Re: A question on RWPM: why the 'metadata' tag?
> 
> Hi Leonard,
> 
> Yes, I expect an RDF parser to parse JSON-LD and to losslessly transcode JSON-LD into RDF, because JSON-LD can be and should be treated as RDF.
> 
> Regards,
> Dave
> --
> David Hyland-Wood
> http://about.me/david_wood <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fabout.me%2Fdavid_wood&data=02%7C01%7Clrosenth%40adobe.com%7C69c819a32c6e4963c3b708d558254950%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C1%7C636511839536996773&sdata=89K%2FnAbHnrUteo5yb8nGnLT97qKIGu%2FnRt8Mvd6EG%2BE%3D&reserved=0>
> 
> 
> On 10 January 2018 at 08:44:21, Leonard Rosenthol (lrosenth@adobe.com <mailto:lrosenth@adobe.com>) wrote:
> David – do you actually expect an RDF parser (“out of the box”) to parse the JSON-LD?  Or did you mean that there is a defined/standardized way to losslessly transcode the JSON-LD into RDF, which can then be processed?
> 
> Leonard
> 
> From: David Hyland-Wood <david@hyland-wood.org <mailto:david@hyland-wood.org>>
> Date: Tuesday, January 9, 2018 at 1:58 PM
> To: Ivan Herman <ivan@w3.org <mailto:ivan@w3.org>>
> Cc: Hadrien Gardeur <hadrien.gardeur@feedbooks.com <mailto:hadrien.gardeur@feedbooks.com>>, W3C Publishing Working Group <public-publ-wg@w3.org <mailto:public-publ-wg@w3.org>>
> Subject: Re: A question on RWPM: why the 'metadata' tag?
> Resent-From: <public-publ-wg@w3.org <mailto:public-publ-wg@w3.org>>
> Resent-Date: Tuesday, January 9, 2018 at 1:57 PM
> 
> Hi all,  <>
> 
> This was a busy thread overnight my time...
> 
> Getting back to Ivan’s original question,
> I agree 100% that the metadata section should be collapsed, that the blank node should be avoided, and that the JSON-LD should be parsable as RDF for processing in systems that can use that.
> 
> I doubt anyone is surprised by my position ;)
> 
> Regards,
> Dave
> --
> David Hyland-Wood
> http://about.me/david_wood <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fabout.me%2Fdavid_wood&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=9oK7y7Lt%2BXChZXZxPDDcHUaM9vndzueQGtW0BnA0evA%3D&reserved=0>
> 
> 
> On 9 Jan 2018, at 22:17, Ivan Herman <ivan@w3.org <mailto:ivan@w3.org>> wrote:
> 
> Sorry, stupid subject line mistake, s/manifest/metadata/ :-)
> 
> I.
> 
> 
> 
> On 9 Jan 2018, at 13:13, Ivan Herman <ivan@w3.org <mailto:ivan@w3.org>> wrote:
> 
> Hadrien,
> 
> (I did not want to put this as an issue at this moment; it may become relevant if we go down the RWPM way but, until then, this should not yet be an 'official' issue. So let us keep to the ML for now.)
> 
> (My apologies to readers who do not have an RDF background; this is a fairly technical mail…)
> 
> I looked at the opening example in[1]. What I was curious was to see how all this looks like in RDF. I used a converter that generated the following triples (I use Turtle which is much more RDF-like…):
> 
> <<<<
> 
> @prefix ns1: <owl:> .
> @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns# <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=zEamPdtXdj1foXMTSHllwUqJ%2FwE7Mu%2BighUmL3qPzLU%3D&reserved=0>> .
> @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema# <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=FcX54HvhwGQqw8AJtnftVw4FfAf5oZOdKK9%2FJdZGquQ%3D&reserved=0>> .
> @prefix schema: <http://schema.org/ <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fschema.org%2F&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=jsrTvMCWwZ2jhNcChNijM%2BvgrfxB0rwA%2FbQYIlHDk3E%3D&reserved=0>> .
> @prefix xml: <http://www.w3.org/XML/1998/namespace <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2FXML%2F1998%2Fnamespace&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=UmvT5TT0d05%2FU1KLZyh3lM%2F0AS%2F7juVVcgD0VHYKS8c%3D&reserved=0>> .
> @prefix xsd: <http://www.w3.org/2001/XMLSchema# <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2F2001%2FXMLSchema%23&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=y6t5H741iPVR7vZYOwR86iU2Uq76SBJ19OANen%2BwpBY%3D&reserved=0>> .
> 
> <urn:isbn:978031600000X> a schema:Book ;
>   schema:author "Herman Melville" ;
>   schema:dateModified "2015-09-29T17:00:00+00:00"^^xsd:dateTime ;
>   schema:inLanguage "en" ;
>   schema:name "Moby-Dick" .
> 
> [] schema:hasPart [ schema:fileFormat "text/html" ;
>           schema:name "Chapter 2" ;
>           schema:url "c002.html" ],
>       [ schema:fileFormat "text/html" ;
>           schema:name "Chapter 1" ;
>           schema:url "c001.html" ] ;
>   ns1:sameAs <urn:isbn:978031600000X> .
> 
> <<<<
> 
> Which, mostly, looks fine, except for that trick of using owl:sameAs to identify the canonical object (the book) with a blank node. I see several issues with that:
> 
> - Die hard RDF/Linked Data people really try to avoid the usage of blank nodes, because they are a source of constant problems in various RDF related routines, algorithms, etc. There are cases when they are almost necessary (the objects in the schema:hasPart construction above look perfectly fine to me), but the outer blank node (ie, the '[]') would really put many people off.
> 
> - I have not looked at the RDF tool landscape lately, but, afaik, OWL is often ignored by RDF related tools. owl:sameAs _may_ be an exception here and there (there are triple stores that do an owl:sameAs reasoning against their data) but this is not universal. (E.g., the Python RDFLib tool does not do that automatically, you have to use external libraries.) This also means that, e.g., SPARQL requests may fail on querying (in the example above) the "c002.html" part if they only use the ISBN identifier although, semantically, this should be fine.
> 
> - (I am not sure the schema.org <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fschema.org&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=ThMZ%2F0T5Blt8jdOD%2Bct2yx94Fu%2Fh5jA4p6uXOdU5yIo%3D&reserved=0> tools are prepared for this, although that may not be a strong argument)
> 
> So I was trying to get rid of this. The usage of owl:sameAs his is the artefact of mapping the "metadata" term against "owl:sameAs" in the context file. This is necessary because, in the RWPM you do have this separate "metadata" term:
> 
> 
> <<<<
> 
> "metadata" : {
>  "@type": "http://schema.org/Book <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fschema.org%2FBook&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=j1XvEaIf50SkZHbGgG%2BKTUPnBewvKWzFHxTmQMjoSpo%3D&reserved=0>",
>   "title": "Moby-Dick",
>   "author": "Herman Melville",
>   "identifier": "urn:isbn:978031600000X",
>   "language": "en",
>   "modified": "2015-09-29T17:00:00Z"
> }
> 
> <<<<
> 
> and if "metadata" is not mapped in context, it will be ignored (together with the JSON object).
> 
> Hence the question: what does this "metadata" term bring to the table in the first place? Why can't one have the example in [1] be simply
> 
> <<<<
> {
>   "@context": "http://",
>   "identifier": "urn:isbn:978031600000X",
>   "author": "Herman Melville",
>   …
>   "spine": [
>      …
>   ]
>   …
> }
> <<<<
> 
> It strikes me as much more straightforward for authors/users as well. I have used this and got the much more straightforward Turtle output:
> 
> <<<<
> @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns# <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=zEamPdtXdj1foXMTSHllwUqJ%2FwE7Mu%2BighUmL3qPzLU%3D&reserved=0>> .
> @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema# <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=FcX54HvhwGQqw8AJtnftVw4FfAf5oZOdKK9%2FJdZGquQ%3D&reserved=0>> .
> @prefix schema: <http://schema.org/ <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fschema.org%2F&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=jsrTvMCWwZ2jhNcChNijM%2BvgrfxB0rwA%2FbQYIlHDk3E%3D&reserved=0>> .
> @prefix xml: <http://www.w3.org/XML/1998/namespace <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2FXML%2F1998%2Fnamespace&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=UmvT5TT0d05%2FU1KLZyh3lM%2F0AS%2F7juVVcgD0VHYKS8c%3D&reserved=0>> .
> @prefix xsd: <http://www.w3.org/2001/XMLSchema# <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2F2001%2FXMLSchema%23&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=y6t5H741iPVR7vZYOwR86iU2Uq76SBJ19OANen%2BwpBY%3D&reserved=0>> .
> 
> <urn:isbn:978031600000X> a schema:Book ;
>   schema:author "Herman Melville" ;
>   schema:dateModified "2015-09-29T17:00:00+00:00"^^xsd:dateTime ;
>   schema:hasPart [ schema:fileFormat "text/html" ;
>           schema:name "Chapter 2" ;
>           schema:url "c002.html" ],
>       [ schema:fileFormat "text/html" ;
>           schema:name "Chapter 1" ;
>           schema:url "c001.html" ] ;
>   schema:inLanguage "en" ;
>   schema:name "Moby-Dick" .
> <<<<
> 
> I can imagine that there *are* some terms that you do not want to appear in RDF. And that is fine: you already use the trick (e.g., for resources) whereby a term that has no mapping in a JSON-LD context (or is not a URI by itself) is ignored by a JSON-LD processor, ie, you can hide anything you want.
> 
> WDYT?
> 
> Ivan
> 
> P.S. Note, b.t.w., that the JSON-LD 1.1 document[2], which is currently a CG draft, introduces the notion of 'nested properties'[3] which does something similar: it essentially says "ignore this term and the resulting nesting, it is semantically meaningless". Ie, if "metadata" would be defined as "@nest" in the context, we would get the same simplified Turtle. At this moment JSON-LD 1.1 is a CG draft, although there are plans to submit that work as a possible WG at W3C to issue it as a new version of JSON-LD, but that is not yet in motion.
> 
> 
> 
> [1] https://github.com/readium/webpub-manifest <https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Freadium%2Fwebpub-manifest&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=J4J1SCCFxWxljlLuXZt5xw34KwYefDlUC1a1X2CHSrY%3D&reserved=0>
> [2] https://json-ld.org/spec/latest/json-ld <https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjson-ld.org%2Fspec%2Flatest%2Fjson-ld&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=P%2FgA7lbBzBzH6bZ0a8mLt3VaAoclEZ552h4FDxi9e7Y%3D&reserved=0>
> [3] https://json-ld.org/spec/latest/json-ld/#nested-properties <https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjson-ld.org%2Fspec%2Flatest%2Fjson-ld%2F%23nested-properties&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=SDSRwhS6%2FXjNFzSpy%2BjEV%2BI3Z3RE7r8yu7sptfwUY5o%3D&reserved=0>
> 
> 
> ----
> Ivan Herman, W3C
> Publishing@W3C Technical Lead
> Home: http://www.w3.org/People/Ivan/ <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2FPeople%2FIvan%2F&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=Rf9hTDVmMiGOzcuhKV5%2Fu54CqKwlkwDRkeViXR9xFg0%3D&reserved=0>
> mobile: +31-641044153
> ORCID ID: http://orcid.org/0000-0003-0782-2704 <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Forcid.org%2F0000-0003-0782-2704&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=6%2BIjziWCxcVKdNLAd2dLkwO%2B5LpYy%2FU%2B63%2F4r%2BiQogM%3D&reserved=0>
> 
> 
> 
> ----
> Ivan Herman, W3C
> Publishing@W3C Technical Lead
> Home: http://www.w3.org/People/Ivan/ <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.w3.org%2FPeople%2FIvan%2F&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=Rf9hTDVmMiGOzcuhKV5%2Fu54CqKwlkwDRkeViXR9xFg0%3D&reserved=0>
> mobile: +31-641044153
> ORCID ID: http://orcid.org/0000-0003-0782-2704 <https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Forcid.org%2F0000-0003-0782-2704&data=02%7C01%7Clrosenth%40adobe.com%7Cd6f52b9dbf1940b42e7208d557ac1a7a%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636511319075263909&sdata=6%2BIjziWCxcVKdNLAd2dLkwO%2B5LpYy%2FU%2B63%2F4r%2BiQogM%3D&reserved=0>

----
Ivan Herman, W3C
Publishing@W3C Technical Lead
Home: http://www.w3.org/People/Ivan/ <http://www.w3.org/People/Ivan/>
mobile: +31-641044153
ORCID ID: http://orcid.org/0000-0003-0782-2704 <http://orcid.org/0000-0003-0782-2704>

Received on Wednesday, 10 January 2018 15:48:17 UTC