W3C home > Mailing lists > Public > public-sdw-wg@w3.org > January 2017

Re: ACTION-249: Further work on BP8 (geometry)

From: Andrea Perego <andrea.perego@jrc.ec.europa.eu>
Date: Wed, 18 Jan 2017 12:17:46 +0100
To: Ed Parsons <eparsons@google.com>
Cc: Linda van den Brink <l.vandenbrink@geonovum.nl>, W3C SDW WG - Public <public-sdw-wg@w3.org>, Andreas Harth <harth@kit.edu>, Joshua Lieberman <jlieberman@tumblingwalls.com>
Message-id: <6c385ee2-7943-9200-3302-d3bb98918041@jrc.ec.europa.eu>
Many thanks, Ed.

About point (3), I think one of the question is whether it would be 
anyway useful to provide examples of existing approaches and the 
relevant use cases.

For instance, for bboxes, we can show how this is done with schema:box 
in Web pages and locn:geometry for GeoDCAT-AP metadata. GeoDCAT-AP 
offers also an example of how to publish geometries in multiple formats 
(WKT, GML, GeoJSON) along with the relevant dataset (so, not as 
independent resources).

About the information to be included in the RDF representation of 
geometries, we already have a number of examples.

For instance, Ordnance Survey basically includes just the geometry - 
see, e.g.:

http://data.ordnancesurvey.co.uk/doc/geometry/37256-10

whereas GADM-RDF provides also a description of the available 
serialisations:

http://gadm.geovocab.org/id/0/60/geometry

I think this is a useful information: by retrieving the RDF 
representation I know whether my preferred serialisation format is 
supported and how I can retrieve it.

I'm just thinking aloud, but other possible options may be:
- include the CRS, bbox, centroid, etc.
- provide an RDF description including the most used vocabularies to 
facilitate re-use.

I'm actually not aware of any operational system doing the latter, but I 
have been experimenting this in a prototype. The resulting RDF 
representation of a point geometry is shown here:

https://github.com/andrea-perego/geoiri/blob/master/examples/point(0_0).rdf

This basically includes:
- links to and descriptions of the available serialisations
- the point represented with GeoSPARQL (WKT+GML), Basic Geo, Schema.org, 
geo URI scheme and GeoHash, all connected with owl:sameAs to the main 
geometry URI

Taking the example above, my understanding is that publishing RDF 
representations of geometries includes, but it is not limited to,
filling the gaps / inconsistencies in existing RDF vocabularies, and may 
require approaches to promote re-usability analogous to the idea of 
providing geometries in multiple serialisations.

Cheers,

Andrea


On 18/01/2017 10:46, Ed Parsons wrote:
> Andrea,
>
> Thanks for your thoughts very useful, where you do suggest approaches in
> 1 and 2 I'm +1 As for 3 where we are less mature in our thinking and
> available approaches we need to just explain the situation and likely
> developments to resolve the issues e.g. Josh's work in the near future...
>
> Ed
>
>
> On Wed, 18 Jan 2017 at 10:22 Andrea Perego
> <andrea.perego@jrc.ec.europa.eu <mailto:andrea.perego@jrc.ec.europa.eu>>
> wrote:
>
>     Dear Linda, all,
>
>     In view of the relevant agenda item of today's call [1], I provide below
>     a summary of the discussions we had so far on how to publish geometries
>     on the Web. Apologies in advance for the long email.
>
>     @All, I kindly ask you to check if what reported here is correct. Any
>     comments & revisions are more than welcome.
>
>
>     1. Preferred geometry format(s)
>
>     BP8 includes already some guidelines, but based on some discussion
>     during the last f2f [2,3], it seems that more explicit recommendations
>     would be desirable. What follows is a tentative contribution based on
>     what I recall from our discussions.
>
>     The reference BP here is the general DWBP B14 principle of providing
>     data in multiple formats: https://www.w3.org/TR/dwbp/#MultipleFormats
>
>     Applied to geometries, this should ideally imply providing geometries in
>     the most used serialisations. However, this may not be always feasible,
>     so it is important to identify one or more preferred geometry
>     serialisations. One of the requirements here is that such serialisations
>     should be preferably Web-friendly. More importantly, we don't want to be
>     prescriptive and prevent people from publishing geometries in their
>     preferred serialisations. So, the recommendation should sound like:
>
>     "Publish geometries in any serialisation you like, but for re-usability
>     it is important that you make them available in format X [, Y, Z, ...]."
>
>     Most on the discussions we had on "preferred format(s)" were about two
>     possible candidates: WKT and GeoJSON. Both are widely used and
>     supported. GeoJSON is the most webby one, but also WKT is supported by
>     popular Web libraries (as OpenLayers and Leaflet). Moreover, WKT is also
>     supported by most triple stores - even those not supporting GeoSPARQL.
>
>     The main drawbacks with GeoJSON seem to be related to the fact that in
>     its current version it supports only one CRS - namely, CRS84 (i.e.,
>     WGS84, with lon/lat axis order) - see:
>     https://tools.ietf.org/html/rfc7946#section-4
>
>     WKT doesn't have this problem, and has other advantages - it's a very
>     compact literal form compared to GeoJSON (as well as other geometry
>     serialisations), it's case insensitive, and it has a corresponding
>     binary encoding (WKB).
>
>     There are however a number of issues:
>     - WKT is available in a number of flavours - e.g., the original WKT
>     format, the extended variant supported in PostGIS (EWKT) [2], the
>     GeoSPARQL variant
>     - The axis order is implemented inconsistently. For instance, in
>     PostGIS, by default it's lon/lat, irrespective of the CRS, whereas
>     GeoSPARQL requires the use of the axis order specified in the CRS
>
>     It has been pointed out that GML does not have the issues above, since
>     both CRS and axis order can be explicitly specified. However, my
>     understanding of the relevant discussion is that GML is not considered
>     webby enough, and it has limited support in Web / LD applications, tools
>     and platforms.
>
>     Trying to come to a conclusion, this is my personal understanding:
>
>     (a) We cannot avoid recommending GeoJSON as (one of) the preferred
>     geometry serialisation(s), because of its widespread use and support on
>     the Web. But with the caveat that it may not be suitable for all use
>     cases, due to the CRS issue.
>
>     (b) We need also a geometry serialisation not having the GeoJSON issues.
>     Between WKT and GML, the former seems to be definitely more suitable for
>     Web and LD applications. But in this case we need to decide which
>     variant should be recommended, and the rule about the axis order.
>
>
>     2. How to publish geometries on the Web
>
>     This point is of course related to the principle of "publishing
>     geometries for Web use", but also to the idea of making geometries a
>     "first-class citizen" on the Web.
>
>     I think that issue here boils done to whether geometries should be
>     published along with the relevant spatial things, or independently.
>
>     There was some discussion in the last f2f [2,3] about the two options of
>     denoting geometries with blank nodes or URIs. Linda provided an example
>     from Ireland, where the rationale about using blank nodes is that the
>     data provider would like people to link to their spatial things, and not
>     to the geometries.
>
>      From this perspective, using URIs for geometries should be based on use
>     cases where people would like to link instead to the geometry itself.
>     Which, I think, is basically related to the question whether / in which
>     cases a geometry is "re-usable".
>
>     A possible use case is when you need to link to some "authoritative"
>     geometry - e.g., an administrative boundary maintained by an
>     institutional agency. Using the relevant URI would ensure not only that
>     I'm referring always to the official and up to date version of the
>     geometry, but I implicitly provide provenance information.
>
>     This is not different from linking to and re-using data maintained by
>     external organisations - e.g., as in the work illustrated by Bart in
>     Lisbon, where fire depts. re-use cadastral data not by copying them
>     locally, but linking to them.
>
>     So, IMO, in BP8 we should mention both approaches, clarifying the
>     different use cases they are addressing. And we can also mention that,
>     depending on the solution used, the publication of geometries in
>     multiple serialisations is different - e.g., for geometry URIs, HTTP
>     conneg can be used.
>
>
>     3. Geometries in RDF
>
>     The main points of discussion seem have been focussed on the following
>     topics:
>
>     (a) Recommended vocabularies and/or best practices for using them
>
>     (b) Which information should be included in the RDF representation of a
>     geometry
>
>     In general, both relate to Josh's work on the revision of GeoSPARQL.
>     However, as far as existing vocabularies are concerned, my understanding
>     is that the only consolidated agreement we have is the use of Basic Geo
>     for point geometries. For other geometry types, bboxes, centroids, etc.
>     we suggest a number of options. The question is whether this is enough,
>     or we should instead provide some more specific recommendations. I can
>     try to collect a number of examples from the reference vocabularies, if
>     this may be helpful.
>
>     About point (b), there was some discussion during the last f2f [2,3],
>     but I'm not sure an agreement was reached. One point quite controversial
>     from the very beginning is whether the RDF representation should include
>     the CRS separately from the geometry specification (this can be very
>     much dependent on how a geometry is modelled). Another issue was about
>     linking a geometry to related geometries (which seems to imply the use
>     of URIs for geometries).
>
>     I think here it would be crucial to have real-world examples as a
>     starting point, and possibly suggest how the can be improved.
>
>
>     Thanks, and sorry again for the long mail.
>
>     Meet you later
>
>     Andrea
>
>     ----
>     [1]https://lists.w3.org/Archives/Public/public-sdw-wg/2017Jan/0067.html
>     [2]https://www.w3.org/2016/12/15-sdw-minutes
>     [3]https://www.w3.org/2016/12/16-sdw-minutes
>     [4]http://postgis.net/docs/ST_GeomFromEWKT.html
>
>     --
>     Andrea Perego, Ph.D.
>     Scientific / Technical Project Officer
>     European Commission DG JRC
>     Directorate B - Growth and Innovation
>     Unit B6 - Digital Economy
>     Via E. Fermi, 2749 - TP 262
>     21027 Ispra VA, Italy
>
>     https://ec.europa.eu/jrc/
>
>     ----
>     The views expressed are purely those of the writer and may
>     not in any circumstances be regarded as stating an official
>     position of the European Commission.
>
> --
>
> *Ed Parsons *FRGS
> Geospatial Technologist, Google
>
> Google Voice +44 (0)20 7881 4501 <tel:%2B44%20%280%2920%207881%204501>
> www.edparsons.com <http://www.edparsons.com/> @edparsons
>

-- 
Andrea Perego, Ph.D.
Scientific / Technical Project Officer
European Commission DG JRC
Directorate B - Growth and Innovation
Unit B6 - Digital Economy
Via E. Fermi, 2749 - TP 262
21027 Ispra VA, Italy

https://ec.europa.eu/jrc/

----
The views expressed are purely those of the writer and may
not in any circumstances be regarded as stating an official
position of the European Commission.
Received on Wednesday, 18 January 2017 11:17:39 UTC

This archive was generated by hypermail 2.4.0 : Thursday, 24 March 2022 20:31:28 UTC