- From: AZ <antoine.zimmermann@emse.fr>
- Date: Tue, 21 Jul 2020 19:36:23 +0200
- To: "Peter F. Patel-Schneider" <pfpschneider@gmail.com>, semantic-web@w3.org
Little complement:
Le 21/07/2020 à 19:15, Antoine Zimmermann a écrit :
> Le 21/07/2020 à 17:38, Peter F. Patel-Schneider a écrit :
>> I looked over the datatype specification.
>>
>> I was puzzled by the relationship between the general datatype and the
>> specific datatypes. It would seem to me that the denotation of "5
>> m"^^cdt:length should be the same as "5 m"^^^cdt:ucum but the former is a
>> length as defined by ISQ and the latter is a pair consisting of the real
>> number 5 and meter.
The previous version of the spec was:
https://ci.mines-stetienne.fr/lindt/v2/custom_datatypes
The definitions of the value space and lexical-to-value mappings were
not very precise, and not in line with what the UCUM spec says. UCUM has
a set of base units that is not the same as the base units of the SI
standard.
The new definition is more precise and aligned with UCUM semantics.
>
> The specific datatypes are actually not really stable and will be
> externalised in a future version. They were added because the
> implementation of UCUM we are using have specific support for all these,
> so we initially thought we would assign IRIs to them. But this set is
> rather arbitrary, so we will make them optional and extensible. The
> specification of these types is based on a former draft and we did not
> update them because we were planning to remove them from the core
> specification.
>
>>
>> As well, why are real numbers used when only decimals can be written?
In fact, this is not correct. In UCUM, the constant π (noted "[pi]" in
UCUM) is defined. It can be used as a multiplier to any unit and its use
may result in non decimal values in the (value,baseunit) pairs. Of
course, a lot of real numbers are still impossible to write, but it is
much more convenient to define it this way.
--AZ
>
> What problem does it bring? The value space of owl:real contains all the
> real numbers and has an empty lexical space. Maybe I'm missing something
> and this has weird consequences, but I do not see them. >
>
> --AZ
>
>
>>
>>
>> peter
>>
>>
>> On 7/21/20 8:35 AM, Antoine Zimmermann wrote:
>>> Regarding physical quantities, such as "5 inches", etc., my colleague
>>> Maxime
>>> Lefrançois and myself coauthored a specification for a datatype for
>>> physical
>>> quantities [1]. It is quite simple: we reuse the Unified Code for
>>> Units of
>>> Measurement (UCUM), a standard that is used in many scientific
>>> applications,
>>> and combine it with a number:
>>>
>>> <QUANTITY> ::= <NUMBER> <SPACES> <UCUMCODE>
>>> <NUMBER> ::= xsd:decimal(('e'|'E')xsd:integer)?
>>>
>>> Since UCUM has a well defined semantics, so does our datatype.
>>> Better, since
>>> UCUM is implemented in many programming languages, my colleague
>>> Maxime could
>>> easily integrate it into Jena and its SPARQL engine [2].
>>>
>>> So, with our Jena fork, one can write:
>>>
>>> SELECT ?planet WHERE {
>>> ?planet a ex:Planet;
>>> ex:diameter ?s .
>>> FILTER(?s > "2e11 mm"^^cdt:ucum)
>>> }
>>>
>>> This works if the size of the planet is encoded as a cdt:ucum, no matter
>>> what unit one is using. One can even use "link for Gunter's chain" (unit
>>> "[lk_us]"), or "cubic meters per acre" (unit "m3/[acr_us]") [3],
>>> which are
>>> both units of length.
>>>
>>> With some of our industrial partners, we are using this for energy
>>> data, and
>>> they seem to be very pleased with this approach, compared to an
>>> ontology-based approach.
>>>
>>>
>>> [1] https://w3id.org/lindt/custom_datatypes#ucum
>>> [2] You can try it at
>>> https://ci.mines-stetienne.fr/lindt/playground.html
>>> [3] Try this query in the playground:
>>>
>>> """
>>> PREFIX iter: <http://w3id.org/sparql-generate/iter/>
>>> PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>
>>> PREFIX cdt: <http://w3id.org/lindt/custom_datatypes#>
>>> PREFIX ex: <http://example.org/>
>>>
>>> SELECT ?length ?normalized
>>>
>>> WHERE{
>>>
>>> VALUES ?position { "2.7e3 m3/[acr_us]"^^cdt:ucum }
>>> # convert to meters
>>> BIND("0 m"^^cdt:ucum + ?position AS ?normalized )
>>>
>>> }
>>> """
>>>
>>> --AZ
>>>
>>> Le 17/07/2020 à 01:57, Cox, Simon (L&W, Clayton) a écrit :
>>>> Yeah, the atomicity of the chunk is the point. This even applies to
>>>> quantities. 25.4mm is *identical* to 1” – they are the same thing. Any
>>>> engine that operates with quantities needs to understand that.
>>>> ’25.4’ and
>>>> ‘mm’ cannot be separated. Coordinates are slightly more complex but it
>>>> comes down to the same thing. A single element within a set of
>>>> coordinates
>>>> that describes a position in space is not independent of the other
>>>> numbers
>>>> in the tuple, or of the coordinate reference system within which
>>>> they are
>>>> expressed. One value should *never* be used independent of the others.
>>>> Exactly the same position on the earth will be denoted by three
>>>> different
>>>> numbers if embedded in a different coordinate reference system. You can
>>>> only ‘reason’ over them as a group, not individually.
>>>>
>>>> *From:*Dan Brickley <danbri@danbri.org>
>>>> *Sent:* Thursday, 16 July, 2020 23:58
>>>> *To:* Jeen Broekstra <jeen@fastmail.com>
>>>> *Cc:* Semantic Web <semantic-web@w3.org>
>>>> *Subject:* Re: Blank nodes must DIE! [ was Re: Blank nodes semantics -
>>>> existential variables?]
>>>>
>>>> …
>>>>
>>>> I believe the big appeal of putting it all into the zone we call
>>>> "literals"
>>>> is that you get a kind of atomicity; that chunk of data is either
>>>> there, or
>>>> not there; it is asserted, or not asserted. With a triples-based
>>>> (description of a ) data structure you have to be constantly on your
>>>> guard
>>>> that every subset of the full graph pattern is at least sensible and
>>>> harmless, even when subsetting these chunks is often confusing or
>>>> misleading for data consumers. I can't help wondering whether
>>>> notions of
>>>> graph shapes from shacl, shex (and sparql) could be exploited to
>>>> create an
>>>> RDF-based data format which had atomicity at the level of entire
>>>> shapes.
>>>>
>>>> Dan
>>>>
>>>> Jeen
>>>>
>>>
>>
>
>
--
Antoine Zimmermann
ISCOD / LSTI - Institut Henri Fayol
École Nationale Supérieure des Mines de Saint-Étienne
158 cours Fauriel
42023 Saint-Étienne Cedex 2
France
Tél:+33(0)4 77 42 66 03
Fax:+33(0)4 77 42 66 66
http://zimmer.aprilfoolsreview.com/
Received on Tuesday, 21 July 2020 17:36:41 UTC