- From: William Van Woensel <william.vanwoensel@gmail.com>
- Date: Sat, 28 Sep 2024 14:39:04 -0400
- To: Thomas Lörtsch <tl@rat.io>
- Cc: Niklas Lindström <lindstream@gmail.com>, Andy Seaborne <andy@apache.org>, RDF-star Working Group <public-rdf-star-wg@w3.org>
- Message-Id: <01DA2229-8C1D-476C-B0F0-82058511B515@gmail.com>
This is wholly removed from the current syntax (and I'm hesitant to propose it for that purpose). But I do anyway, if only for the reason that it solves a few issues that have been bugging me. It follows from thinking about what kind of meaning (I think) people associate with a symbol, such as "?" and "!". It uses "?" for triple terms when we don't want to assert the triple - the "?" reflecting some uncertainty they have about the triple. It uses "!" for triple terms when we do want to assert the triple - the "!" reflecting certainty about the triple. E.g., for reified triples (reifiedTriple): [? :employee38 :jobTitle "Assistant Designer" ?] :accordingTo :employee22 . or, <<? :employee38 :jobTitle "Assistant Designer" ?>> :accordingTo :employee22 . For annotations: [! :employee38 :jobTitle "Assistant Designer" !] :accordingTo :employee22 . or, <<! :employee38 :jobTitle "Assistant Designer" !>> :accordingTo :employee22 . (i.e., the syntax applies to the triple term, not the metadata) This could solve the following issues: - Currently, the annotation syntax applies to the _metadata_ of the triple term; reified triple syntax applies to the _triple term_. IMO, reading the Turtle syntax - after not having followed it closely for a few years - I got a bit confused at first for that reason. (Well, to be honest, I was glossing over it and making assumptions for annotations based on reified triples.) The above syntax annotates the triple term itself for both cases, which removes this dichotomy. On the other hand, I found that the annotation syntax does allow writing less "disjointed" code - see gist below. - It frees up the standalone << and >> so it can be used for triple terms (tripleTerm) instead of the (in my opinion, horrible) <<( and )>> . - Reifier terms can now be uniformly indicated to as follows: [? ~ _:id1 :employee38 :jobTitle "Assistant Designer" ?] :accordingTo :employee22 . [! ~ _:id1 :employee38 :jobTitle "Assistant Designer" !] :accordingTo :employee22 . Or <<? ~ _:id1 :employee38 :jobTitle "Assistant Designer" ?>> :accordingTo :employee22 . <<! ~ _:id1 :employee38 :jobTitle "Assistant Designer" !>> :accordingTo :employee22 . (or, using "id" as in N3 IRI property lists <https://w3c.github.io/N3/spec/#iriprplist>) I don't think "!" and "?" are legal in PN_PREFIX; if not, there's no ambiguity there. The problem with "?" is that it is already used for variables in N3; there would be a need for a whitespace after an opening [? . A drawback is that one can no longer write statements such as in Niklas's examples: <http://www.wikidata.org/entity/Q34851> a sdo:Person ; rdfs:label "Elizabeth Taylor"@lb , sdo:hasOccupation <http://www.wikidata.org/entity/Q12362622> , <http://www.wikidata.org/entity/Q3282637> ~ s:Q34851-09B35D31-83D6-49DA-9615-12E025DD1016 {| a :Circumstance ; :reference [ :inferredFrom_P3452 <http://www.wikidata.org/entity/Q6455247> ] , [ :statedIn_P248 <http://www.wikidata.org/entity/Q54862624> ] |} . I.e., where you can easily attach annotations to different objects in an object list. (But honestly, this syntax confused me a bit when I first saw it as well.) Indeed, after converting the example below, I noticed that it leads to code that is much more disjointed than Niklas' original example. It is also much more tricky to convert Niklas' examples into this syntax :-) It took me a bit and I had to use Jena to parse the original code; AFAICT Jena does not support reifier IDs nor multiple chained annotations (i.e., :s :p :o {| ... |} {| ... |}) so I had to remove these from the example. https://gist.github.com/william-vw/ce67c98486d10c1e38901f8c6ab15be9 Apologies if someone has already suggested it and I missed it. W > On Sep 28, 2024, at 8:58 AM, William Van Woensel <william.vanwoensel@gmail.com> wrote: > > FWIW, my personal view - the unified pipe syntax indeed looks a bit confusing in large examples. Which part is the identifier, and which is the annotation? IMO it is better to have a dedicated symbol for particular purposes, such as the ~ for reifier terms. > > When used individually, my issue with the pipe operator is its "baggage" - it is known and used for a different purpose. But, I think Thomas makes a good point with [ ] being used for adding details, such as an editor's note; and that { } should be reserved for graphs. > > Instead of the "|" symbol, perhaps adding a "?" (i.e., [? xyz ?]) could be more suitable. To me, the symbol conveys something like "what more can be said about this statement? well...". E.g., > > ex:Ioannes_68 a crm:E21_Person , > ex:Gender_Eunuch ~ ex:Gender_Assignment_Eunuch [? a crm:E17_Type_Assignment ; > crm:P14_carried_out_by ex:Paphlagonian_family ; > rdfs:label "Castration gender assignment" ?] ; > rdfs:label "John the Orphanotrophos" . > > I don't think it would clash with the Turtle grammar, but, it could clash with the N3 variable syntax (well, not if we require a whitespace after the "?"). On a related note, the potential "baggage" of this symbol is its association with variables. > >> There has been long discussions about the current syntax in github issues. No one will be happy about everything in syntax discussions. > > Sorry to be adding to it. This option may have already come up; if so, feel free to disregard. > > > W > >> On Sep 27, 2024, at 3:43 PM, Thomas Lörtsch <tl@rat.io> wrote: >> >> Hi Niklas, >> >> thank you for the links! I agree that these are indeed large examples, and thank you for the effort. Still, and I know that I do sound like a measly know-all when I say this, they are still very few ;-) But it’s unreasonable to expect us to get much further with example data (and if we did, it would still not be sure that we could evaluate them properly). Syntax is in a lot of ways a matter of taste and intuition. IMO it’s important to try to stick to some principles and seemingly objective criteria, however without getting hung up on those too much ;-) >> >>> On 23. Sep 2024, at 17:00, Niklas Lindström <lindstream@gmail.com> wrote: >>> >>> Yes, I tried out syntax variants on the UCR examples, plus a larger >>> example based on the full Wikidata description about Elizabet Taylor >>> (complete with nominations, awards, spouses and nationalities (e.g. >>> twice a US citizen)). For illustration, I just added a new gist with >>> those updated to the new syntax: >>> >>> https://gist.github.com/niklasl/c0ba767efe4816a515ad04a4db48b3e6 >> >> Very nice! I just converted them to my proposal from [7]: >> Liz: https://gist.github.com/rat10/ddfd60afb42a8062fd7f1680ebedd022 >> UCR: https://gist.github.com/rat10/6c66e360c36b7d81bb3b9bc21fc16b96 >> >> The good news is: this is relatively easy to do :) >> >> The bad news is: this reads not particularily well. In the current version (i.e. in yout gist linked above) annotation syntax seem visually better discernible from standard triples. The cost however, especially that it uses curly braces which should be reserved to graphs, is IMO too high. Seems to me like more thinking and tinkering is needed… >> >> However, the difference is more pronounced in the Elizabeth Taylor example which is also in the current syntax too involved to be really readable, especially because of those excrutiatingly long identifiers. Some line breaks would certainly help but I couldn’t figure out how to introduce them automatically in a sufficiently nice way (i.e. with proper indentation). >> >>> (One caveat is the last UCR example using a full list in a triple >>> occurrence; also mentioned in [1].) >> >> Uff. I’ll comment on that in the issue itself. >> >>> (The now obsolete examples I linked to from the comments on either the >>> original github issue [2] or the addressing PR [3] are at [4] and [5}. >>> Of note in [2] is that pipe collided with SPARQL alternativePath in >>> annotations; which this change fixed.) >> >> I just read through [3] again and noticed a comment by Andy saying that "If we go postfix, then '~' vs '|' is pure choice" [6]. If that is indeed correct (I guess it hasn’t been tested thoroughly as the discussion from that point on favored the tilde) then it’s good to know. Aesthetically I find the tilde quite okay. However, I also have that urge to unify the syntactic variations, as outlined in [7], and in that respect the pipe seems better. >> >> Best, >> Thomas >> >>> >>> Best regards, >>> Niklas >>> >>> [1]: <https://github.com/w3c/rdf-turtle/issues/71#issuecomment-2363703036> >>> [2]: <https://github.com/w3c/rdf-star-wg/issues/116> >>> [3]: <https://github.com/w3c/rdf-turtle/pull/51> >>> [4]: <https://gist.github.com/niklasl/c23925f831950506fde4eb73885319cd> >>> [5]: <https://gist.github.com/niklasl/1845c6bc8b37402cc9698720c2e22f88> >> [6] https://github.com/w3c/rdf-turtle/pull/51#issuecomment-2256850306 >> [7] https://lists.w3.org/Archives/Public/public-rdf-star-wg/2024Sep/0073.html >>> >>> >>> On Fri, Sep 20, 2024 at 12:37 PM Andy Seaborne <andy@apache.org> wrote: >>>> >>>> >>>> >>>> On 20/09/2024 09:46, Thomas Lörtsch wrote: >>>>> this is one of your typical "arguments": seems to look so wise, but is so vacuous all the same. if you think you know something that can only be seen in large examples, then show it or at least describe it in some detail. don't expect everybody to just believe in your wisdom >>>> >>>> There have been examples done by Niklas on the visual impact of syntax >>>> designs. >>>> >>>> >> >> >
Received on Saturday, 28 September 2024 18:39:22 UTC