- From: Phil Archer <phila@w3.org>
- Date: Wed, 10 Feb 2016 11:59:00 +0000
- To: SDW WG Public List <public-sdw-wg@w3.org>
Many of those involved in the F2F Monday-Tuesday this week are here for
a third day at the PiLOD/EuroSDR meeting. We held a breakout session
with potential audience for the BP doc. Both Andrea and I made some
notes of the occasion, including recording some new issues.
http://www.pilod.nl/wiki/Spatial_Data_on_The_Web_Event_10_February_2016
Notes are online at
https://www.w3.org/2016/02/10-sdw-minutes
Text version below.
[1] http://www.w3.org/
F2F Discussion in Amersfoort with EuroSDR, PiLOD, etc.
10 Feb 2016
Attendees
Present
Linda, Andrea, Ed, Jeremy, PhilA, ChrisLittle,
10_guests, BillRoberts, Bart
Regrets
Chair
Jeremy
Scribe
phila
Contents
* [2]Topics
* [3]Summary of Action Items
* [4]Summary of Resolutions
__________________________________________________________
This is a rough record of the discussion held at the
[5]EuroSDR/PiLOD meeting in Amersfoort right after F2F 3. It
was an occasion to meet some of the potential audience for the
BP doc. It is not an official WG meeting of course but did
raise some issues that were added to the traker.
[5]
http://www.pilod.nl/wiki/Spatial_Data_on_The_Web_Event_10_February_2016
Andrea's Notes
MEETING STARTS
Jeremy: The SDW WG was introduced in the plenary session. We collected s
ome topics that may be relevant for discussion ...
... sw architecures used ...
... which formats, which types ...
... what prevents you from using spatial data in your organisations ...
... we have been trying to structures the BPs around a specific use case
s. If you have real examples to share, please do ...
... tell us what is missing.
Chris: OGC changed the policy about who is the target public, and also n
on-members can influence the process.
Jeremy: Maybe we should do a tour-de-table to know who's here.
Linda: Work at Geonovum, and involved in SDW WG
Ed: Co-chairing the SDW WG, work at Google
??: GIS consultant and specialist. I would like to contribute to the BPs
by experimenting and testing
??: Want to learn at Linked Data, and know how a book will look like in
the future
??: Focus on publishing data in a way people can understand. Work on one
of the Geonovum testbed
??: From Norway, work with OGC standard, but also sensor web. Currently
more focusses on WFS. Find it difficult to navigate from a complex XML f
ile by using tools users use.
Chris: Meteorologist at MetOffice. Interest in how to make 4-dimensional
data
??: Work at Geonovum and very interested in what you're doing
??: PhD student at IGN France, work on linking geo data on the Web
??: From IGN France, very curious on the BPs, especially wrt spatio-temp
oral data, multidimensional data. This approach may bring new opportunit
ies wrt the traditional approach.
??: Web cartographer and developer. Work for Amsterdam municipality, peo
ple knows everything on geo data, but not much on scripting and APIs.
??: Worked on integrating information from different perspective, coveri
ng geo but not only - e.g., CAD. Interested to see how all these "spatia
l" aspects can be brought together.
Phil: From W3C, coordinating the SDW WG.
Jeremy: Work at UK's MetOffice.
Paul van Genuchten: From Geocat, taking care of GeoNetwork, we are integ
rating a lot of webby things. Participating in the testbed.
Jeremy: BP is just one of the deliverables.
Ed: This is actually the 2nd deliverable, but gives the contex to what w
e have been doing. The first one was the UCRs, edited by Frans and Aleja
ndro...
... the other deliverables focussed on specific ontologies - Time, SSN,
Coverages (any data in time and space, very much used in the GIS world).
Jeremy: About the scope...
... three ways of interacting: being a member of the group, drop a mail
to the mailing list, use GH to submit issues ...
... there are 5 themes, reflecting how we think about the problem space.
Ed: One of the challenges we are trying to solve is to identify those BP
s that can be easily implemented, and those that can be considered advan
ced features.
Jeremy: The first problem is the assignment of identifiers. There's some
discussion on whether identifiers can be assigned to real-world things
or only to information resources. We try to give some advise on how to a
ssign IDs to things...
... but to be actionable on the Web these IDs need to be HTTP URIs.
Jeremy: We are also trying to explain why you have to do it, and which a
re the possible advantages...
... the point is that the BPs must be durable. It is difficult to unders
tand whether or not to give people technical guidance, since in the near
future these technologies may be out of date
??: Why not putting BPs in a menu, ranked wrt the 5-star criterion.
Jeremy: That's a good idea. We also recognise that people start from dif
ferent positions, so we may need to reflect this in the BPs.
??: Yes, but if the content of the BPs is made somehow visible (e.g., in
a visual way) this can help people pick what they need.
Jeremy: Another good point.
Jeremy: But do you want concrete guidance, or not?
Ed: We can have both: an abstract description, complemented with an exam
ple on how this can be implemented by using a given technology.
(Many): Agree, concrete examples are needed
??: I recall a wiki having two sections, one that is fixed, and cannot b
e changed, and another one that is kind of working document.
Jeremy: But which platforms the examples should be taken from?
Paul van Genuchten: On of the issues is that people may want to do thing
s, but they are short of fundings. It may be worth addressing the policy
level, and not only the technical one.
Ed, Jeremy, Phil: agree
[gap]
Jeremy: So, you should provide examples of incremental approaches.
Jeremy: One of the options is about re-using authoritative IDs.
??: There is a possible disadvantange depending on who owns the data.
Jeremy: True, there are situations where you cannot / don't want to re-u
se existing data.
Jeremy: Another option is to re-use your local IDs...
... we give guidance on how to operate persistent IDs ...
... IDs for data slices / subsets ...
Paul van Genuchten: Why the last is not under DWBP? Is there something t
ypically spatial?
Jeremy: This is indeed a general requirement, but the truth is that ther
e's no mechanism defined for other data that can be re-used for spatial
data.
??: We had to tackle with large objects (dataset items).
Jeremy: We have a section dealing with large data, but not on specific t
ypes (e.g., geometry)
Jeremy: Another BP referes to an issue mentioned during the plenary is t
hat there are way too many ontologies, but eventually people don't use t
hem, and prefer JSON/GeoJSON.
Ed: There's an open issue about not being to much focussed on RDF. We ar
e not going to say that RDF is the only solution.
Jeremy: Another BP is about providing the right level of complexity and
the right encoding depending on the use case / application. And in some
cases you don't need to provide a precise geometry (e.g., the boundaris
of the American West).
Jeremy: Another issue is about CRSs. A lot of Web people don't (want to)
know about CRSs...
... Another point is about relative positions ...
... positional accuracy.
??: [explaining the use case about ability to refer multiple representat
ions to the same resource] ...
... example is a building, described in XML, in a LD way, as a polygon o
n a map a CAD project. People relates them as the same thing, but this i
s not the case, because they are describing the same thing from differen
t perspectives.
Jeremy: [moving on] Related points are make links visible on the Web, ex
press relationship, making data indexable by search engine, link to the
real things and not to their representations.
??: Do you link to other BPs.
Ed: Yes, the principle is not reinvent.
??: We should also prevent geo people from doing some things. People sti
ll thinks at geo data as maps, and, e.g., place textual labels as spatia
l objects themselves.
Jeremy: [issue added]
??: As a data publisher, you don't know who's using your data, for which
use cases. It might be good to point to solutions that can be used to e
xpose data in a way able to address the most common applications.
Paul van Genuchten: I have a problem with the word "publisher". This may
give people the wrong idea that LD is only about data publication, wher
eas this should affect also how data are maintained internally (e.g., th
is applies to IDs / URIs).
??: Use case missing [missed it]
Linda: [addressing people from IGN] Do you use LD?
??: Just for research work, at the moment.
??: We are actually publishing the administrative boundaries of France.
It's experimental, currently.
Jeremy: You can then help testing the BPs.
??: Accuracy, vagueness are key issues for us.
END OF MEETING
Phil's Notes
<scribe> Agenda: Ad Hoc
scribe; phila
<scribe> scribe: phila
Tour de table shows guests are mostly developers looking for
guidance
Jeremy: Show the We Need You slide with some questions we're
looking to answer
... Shows the BP doc [6]http://w3c.github.io/sdw/bp/
[6] http://w3c.github.io/sdw/bp/
Ed: Gives overview of the other deliverables of the WG
JT: Goes through the document.
... Begins with the audience section - who is this for
... Describes the Issue flags etc.
... Issues link to GH Issues
... Or can write to the mailing list
... Describes the 5 themes seen in the BP doc
Ed: Talks about incremental steps from small to 5* etc.
JT: Theme 1 - assignment of identifiers
... Gets dangerously close to HR-14
... So doc tries to give advice on what you should give
identifiers to.
... Describes the structure of a BP
... Talks about difference between intended outcome and
possible approaches to implementation.
... Do you want concrete implementation - which may go out of
date - or do you want "create this outcome" but with less
guidance.
Guest: In publication, we make a lot of use of flow charts.
... Maybe put in a chart with the different star levels and
assign BPs to each one.
JT: That might work as it shows you where to start.
... We also have to think about different starting points,
someone with an SDI cf. a Web Dev
Guest: When I got the book about Linked Data, I looked at the
index and decided where I was going to start.
... A flow chart would have been helpful.
JT: Refers to DWBP which has used a visual method for
navigation.
Ed: An abstract description cf. more practical
Guest2: Talks about low hanging fruit and bookmarking pages
etc. The longer term vision is useful.
Guest3: I think you need both. You need more proven technology
section and maybe a future-looking approach.
... A wiki in the airline industry where there are very strict
regulations, and there's a draft section, and the sections are
clearly delineated with colours etc.
Guest4: I think you need concrete examples for each BP.
Ed: We do have a clear internal vision that we want to help
implementers to grab bits of code and start using it.
Guest5: If you're talking about code you need to talk about
platforms etc.
Ed: Yes, and we recognise that any choice will be out of date
in time.
Guest5: There is no one choice of today. JSON(-LD)? works on
all platforms, it's a data structure, but be careful of
choosing a development platform.
JT: If we were to choose a platform...
phila: W3C/Web Platform etc.
Guest5: I don't see examples
JT: Not yet...
... I want to come back to choice of platform. Do you want to
see them in etrms of OGC services, or Python snippets.
Guest: We have traditional data providers, like governments,
that we like to tickle to share their data on the Web. Only a
minority will be tickled.
... There's the other groups that need regulation from above.
... You dopn't always target the data provider, you target the
one above the provier to encourage them to push for change.
... You may need a differnet documentation approach.
Ed: You're saying it's more about policy and context?
... More the business case?
Guest: yes.
JT: Something that a manager can pass on
Ed: It can be more challenging to make the case for LD to
policy makers who have spent a lot of money building SDIs.
Andrea: Another point might be - how can I implement this as a
layer on top of the existing structure? What are the
incremental steps? - useful?
Guest: I say from a tech POV, everything is possible.
... I find a layer on top of a layer may not be the best way.
Andrea: I was thinking of alternatives, that you can start from
scratch, or slowly add bits.
Guest: Our experience is that companies like incremental steps
that show benefits. And LD allows you to start small.
... You can guide comapnies towards this practice rather than
enforcing it from above. benefits win over force.
Ed: There's a perspective we haven't communicated - beyond the
SDI world, there is data on the Web that is not explicitly
spatial that we can make explicitly spatial.
... We can say, if you're just building Web pages, how do you
add a little bit of structured data to make location
understandable.
JT: So we want BPs that say if you do this little thing, you'll
see that benefit.
... BP2 is about pointing people to reuse existing
authoritative identifiers. NL has URIs for dykes, buildings
etc.
... reuse geonames, there are lots of IDs there. Network
Effect.
ChrisLittle: We now have a target of no more than 4 links
(facebook thing)
JT: There are situations where you can't use somebody else's
identifier, but where you can, you should.
Chris: The first star, the licensing side - that's hard. We now
have 2 categories: open data and open data that is managed -
which you need to use responsibly.
JT: We say that you probably have IDs all over the place, but
they're local ones. How can we help you create URIs from those
local IDs.
... BP4 - changing over time, when tyo mint a new ID.
... Guidance on giving identifiers to larger parts of info.
... You may only want a smaller subset.
... This sparked a lot of debate (see
[7]https://lists.w3.org/Archives/Public/public-sdw-comments/201
5Dec/0000.html onwards)
[7]
https://lists.w3.org/Archives/Public/public-sdw-comments/2015Dec/0000.html
Guest: Why is this is the SDW, not DWBP?
JT: Because it came up looking at large spatial
datasets/coverages. DWBP didn't come up against it and don't
have an answer.
Ed: The heirarchy of places is relevant as well. One place is
within another. Looking more now at placial, not spatial.
Guest: This is relevant to such a large audience, it needs to
be seen from one angle and then work on others. Can you
actually tackle the big topics.
JT: When we look at the broader problems, we do so through a
spatial lens - or we'd never get it finished.
... Describes the how to test section. We want to yo be able to
know if you've followed the BP.
Guest: You're talking about large datasets here. A topic we
looked at in our research was looking at large objects, so at
the entity level.
... Devleopers don't need to detailed accuracy that comes with
some objects.
JT: We have a section talking about larger data, but not large
objects.
issue: Need a BP on handling large spatial objects, i.e. with
lots of detailed geometry. (BP doc)
<trackbot> Created ISSUE-39 - Need a bp on handling large
spatial objects, i.e. with lots of detailed geometry. (bp doc).
Please complete additional details at
<[8]http://www.w3.org/2015/spatial/track/issues/39/edit>.
[8] http://www.w3.org/2015/spatial/track/issues/39/edit
JT: Talks about Expressing spatial data section.
... People use what they use. We want to try and provide clear
guidance on what to do, including if you have a tool chain in
place.
Ed: Draws attention to Issue-225, being too RDF-focussed.
... We're not going to say that the only solution is to use
RDF.
JT: Some people will have heard of JSON-LD, a LD dialect that
lets you get a long way into RDF without actually touching the
Semantic Web baggage.
... Talks about BP6 - which will give us a way to give lost of
differnet examples of how to do stuff.
... There is a spatial thing called 'The American West' - which
doesn't have an agreed boundary.
... Not everything has to have a precise geometry.
... Talks about the importance of CRSs
... When you get to particular application domains, you might
need a different CRS.
... BP9 talks about relative positions.
... 11 Talking about time.
Linda: Sander came to me with a use case that might give us a
new requirement that might be relevant to the spatial
relationships.
... They're trying to integrate data from different
perspectives. One might talk about the properties of the
building itself, others will talk about the representation of
the thing.
Sander steps up to the white board...
Sander: I want to show why it matters that we have these
relationships.
... Gives an example of a building with doors, windows.
... And we have an info-source that might be an XML doc
... Has IDs for the building, the door, the window.
... And there might be a map that shows the polygon that
represents that building.
... This is the kind of data we work with for construction
companies.
... Come say we have a LD approach with classes for the
building, hasPart Door etc.
... Talking about tangible objects
... But others would talk about XML elements with attributes.
... We need a property to say we're talking about the same
thing.
... Something that says I know these things are expressed
differently but they are about the same thing.
... We need to bring these things together in a way that makes
sense and we can't use owl:sameAs
... ArcGIS uses a table to represent things and people will say
that a cell in the table is the same as the thing itself
Ed: In the GIS world, everything is digital, we didn't make the
link to the real world.
Sander: With LD, people see that you can link thing, but if we
want to crawl data and link it, we need to be clear what the
differences are.
Ed: Gives his door lock example
Linda: Not everyine in the WG is convinced that we need to
address this.
Sander: It's not for me to say whether it's for this WG to
tackle.
Guest: Are you talkking to the lock or the wi-fi card in the
lock?
JT: We're beginning on this journey with a plan to say that you
need to identify real world things separately from data about
them.
Guest: What kind of datasets are you working with?
Sander: We have designs, the building process, maintenance -
lots of moments of data needs. For design, we're talking about
CAD. Spatial info we're talking about location of course, this
plot of land etc.
... relations to otehr areas of land.
... And then there are aspects such as validation - whether the
buildings meet regulations etc.
... Do the objects fulfil the needs we extablished. Differnet
people, differnet tooling, differnet priorities.
... These need to be brought together for the owner of hte
road/building who needs all those things.
... We want to be able to say more about all those things.
guest: Why don't you use the standards of the building sector?
== Discussion ensues==
Sander: everyone has their own standards. merging them is
difficult when you want to understand different aspects.
JT: calls time out
... The data integartion one is one we need to include.
... The specific case would be a lunch time discussion.
... Runs through remaining BPs quickly
Sander: Are you linking to other BPs such as self-describing
APIs etc.
Ed: We want to avoid creating anything new if we can avoid it.
JT: Please point them out to us and then we can use them -
don't assume we're going to do the right thing.
... Returns to his initial questions about software
architectures, formats etc.
Guest: There are things that we should try and stop people
doing. Like people using points with a label, but when you zoom
in it all goes wrong.
... People think of spatial data as maps - that's the problem.
... On people's desktop, it works. But instead of having an
annotation it becomes anotehr spatial object.
issue: Making sure that attributes on maps, eg labels, are not
traeted as spatial objects themselves.
<trackbot> Created ISSUE-40 - Making sure that attributes on
maps, eg labels, are not traeted as spatial objects
themselves.. Please complete additional details at
<[9]http://www.w3.org/2015/spatial/track/issues/40/edit>.
[9] http://www.w3.org/2015/spatial/track/issues/40/edit
linda: We need examples and I was wondering whether Edward
could provide such examples for things that change over time.
... is it going to be persietent? Can we link to it?
... We're talking about a Web site that lists all the Dutch
municipalities that exist and those that used to exist.
Guest: The hard thing in general is that publishers don't know
waht their data is being used for, so you need to be flexible
in what you provide.
... Might be intersting to promote things like aggregation,
faceting, to make data more accessible.
... It also relates to the size of geospatial datasets. A list
of all restaurants in NL is long. But if I only want certain
types in certain areas, I don't want all that.
... If we can use these kinds of mechanisms this would help.
JT: That gets us into subsetting.
BartvanLeeuwen: A large part of what you're saying is/should be
covered in DWBP
Guest: Makes the point about the workd 'publish' - gives people
the idea that LD is all about publication. Actually it's about
*storing* data on the Web
... I really don't like the word publish in this context.
... People do a lot of transformation before they publish it as
INSPIRE, rather than actually use INSPIRE.
Dog Food
Ed: WE also need to be aware of the flip case of that. We don't
want to require people to publish all their data.
... It's obviously easier if you use the same IDs inside and
outside the firewall.
issue: Don't rely on the word 'publish' - it's about storting
data on the Web (BP Doc)
<trackbot> Created ISSUE-41 - Don't rely on the word 'publish'
- it's about storting data on the web (bp doc). Please complete
additional details at
<[10]http://www.w3.org/2015/spatial/track/issues/41/edit>.
[10] http://www.w3.org/2015/spatial/track/issues/41/edit
JT: We are keen to have examples of stuff you're doing in the
wild. So thank you - please let us know.
Guest: You're talking about your use cases. We make books -
we're struggling to use Top Braid Composer, linking different
parts of our system.
JT: Talks about the BBC Linked Data Platform.
Linda: What is IGN's interest in LD?
IGN: We have a lot of interest in it for our research
proposals. We've done a lot but it takes time. We're discussing
use of URIs as IDs
... Some people are afraid, whether they have to be
dereferencable or not etc.
... Talking about interlinking. I'll be presenting something on
this later today.
... We are publishing the Administrative units on France as LD,
accessible via a SPARQL endpoint.
JT: So you'd be in a good position to road test the BPs
... is this doc accessible to me? Does it help me?
IGN: Issue of accuracy, merging sets at different levels of
granularity.
Ed: We're not going to solve fundamental problems of spatial
data - we might just raise a warning flag.
JT: Draws the meeting to a close.
Summary of Action Items
Summary of Resolutions
[End of minutes]
__________________________________________________________
Received on Wednesday, 10 February 2016 11:58:58 UTC