[Minutes] 2016-02-10

Many of those involved in the F2F Monday-Tuesday this week are here for 
a third day at the PiLOD/EuroSDR meeting. We held a breakout session 
with potential audience for the BP doc. Both Andrea and I made some 
notes of the occasion, including recording some new issues.

http://www.pilod.nl/wiki/Spatial_Data_on_The_Web_Event_10_February_2016

Notes are online at
https://www.w3.org/2016/02/10-sdw-minutes


Text version below.

       [1] http://www.w3.org/

          F2F Discussion in Amersfoort with EuroSDR, PiLOD, etc.

10 Feb 2016

Attendees

    Present
           Linda, Andrea, Ed, Jeremy, PhilA, ChrisLittle,
           10_guests, BillRoberts, Bart

    Regrets
    Chair
           Jeremy

    Scribe
           phila

Contents

      * [2]Topics
      * [3]Summary of Action Items
      * [4]Summary of Resolutions
      __________________________________________________________

    This is a rough record of the discussion held at the
    [5]EuroSDR/PiLOD meeting in Amersfoort right after F2F 3. It
    was an occasion to meet some of the potential audience for the
    BP doc. It is not an official WG meeting of course but did
    raise some issues that were added to the traker.

       [5] 
http://www.pilod.nl/wiki/Spatial_Data_on_The_Web_Event_10_February_2016

Andrea's Notes

MEETING STARTS

Jeremy: The SDW WG was introduced in the plenary session. We collected s
ome topics that may be relevant for discussion ...
... sw architecures used ...
... which formats, which types ...
... what prevents you from using spatial data in your organisations ...
... we have been trying to structures the BPs around a specific use case
s. If you have real examples to share, please do ...
... tell us what is missing.
Chris: OGC changed the policy about who is the target public, and also n
on-members can influence the process.
Jeremy: Maybe we should do a tour-de-table to know who's here.
Linda: Work at Geonovum, and involved in SDW WG
Ed: Co-chairing the SDW WG, work at Google
??: GIS consultant and specialist. I would like to contribute to the BPs
  by experimenting and testing
??: Want to learn at Linked Data, and know how a book will look like in
the future
??: Focus on publishing data in a way people can understand. Work on one
  of the Geonovum testbed
??: From Norway, work with OGC standard, but also sensor web. Currently
more focusses on WFS. Find it difficult to navigate from a complex XML f
ile by using tools users use.
Chris: Meteorologist at MetOffice. Interest in how to make 4-dimensional
  data
??: Work at Geonovum and very interested in what you're doing
??: PhD student at IGN France, work on linking geo data on the Web
??: From IGN France, very curious on the BPs, especially wrt spatio-temp
oral data, multidimensional data. This approach may bring new opportunit
ies wrt the traditional approach.
??: Web cartographer and developer. Work for Amsterdam municipality, peo
ple knows everything on geo data, but not much on scripting and APIs.
??: Worked on integrating information from different perspective, coveri
ng geo but not only - e.g., CAD. Interested to see how all these "spatia
l" aspects can be brought together.
Phil: From W3C, coordinating the SDW WG.
Jeremy: Work at UK's MetOffice.
Paul van Genuchten: From Geocat, taking care of GeoNetwork, we are integ
rating a lot of webby things. Participating in the testbed.
Jeremy: BP is just one of the deliverables.
Ed: This is actually the 2nd deliverable, but gives the contex to what w
e have been doing. The first one was the UCRs, edited by Frans and Aleja
ndro...
... the other deliverables focussed on specific ontologies - Time, SSN,
Coverages (any data in time and space, very much used in the GIS world).
Jeremy: About the scope...
... three ways of interacting: being a member of the group, drop a mail
to the mailing list, use GH to submit issues ...
... there are 5 themes, reflecting how we think about the problem space.
Ed: One of the challenges we are trying to solve is to identify those BP
s that can be easily implemented, and those that can be considered advan
ced features.
Jeremy: The first problem is the assignment of identifiers. There's some
  discussion on whether identifiers can be assigned to real-world things
or only to information resources. We try to give some advise on how to a
ssign IDs to things...
... but to be actionable on the Web these IDs need to be HTTP URIs.
Jeremy: We are also trying to explain why you have to do it, and which a
re the possible advantages...
... the point is that the BPs must be durable. It is difficult to unders
tand whether or not to give people technical guidance, since in the near
  future these technologies may be out of date
??: Why not putting BPs in a menu, ranked wrt the 5-star criterion.
Jeremy: That's a good idea. We also recognise that people start from dif
ferent positions, so we may need to reflect this in the BPs.
??: Yes, but if the content of the BPs is made somehow visible (e.g., in
  a visual way) this can help people pick what they need.
Jeremy: Another good point.
Jeremy: But do you want concrete guidance, or not?
Ed: We can have both: an abstract description, complemented with an exam
ple on how this can be implemented by using a given technology.
(Many): Agree, concrete examples are needed
??: I recall a wiki having two sections, one that is fixed, and cannot b
e changed, and another one that is kind of working document.
Jeremy: But which platforms the examples should be taken from?
Paul van Genuchten: On of the issues is that people may want to do thing
s, but they are short of fundings. It may be worth addressing the policy
  level, and not only the technical one.
Ed, Jeremy, Phil: agree
[gap]
Jeremy: So, you should provide examples of incremental approaches.
Jeremy: One of the options is about re-using authoritative IDs.
??: There is a possible disadvantange depending on who owns the data.
Jeremy: True, there are situations where you cannot / don't want to re-u
se existing data.
Jeremy: Another option is to re-use your local IDs...
... we give guidance on how to operate persistent IDs ...
... IDs for data slices / subsets ...
Paul van Genuchten: Why the last is not under DWBP? Is there something t
ypically spatial?
Jeremy: This is indeed a general requirement, but the truth is that ther
e's no mechanism defined for other data that can be re-used for spatial
data.
??: We had to tackle with large objects (dataset items).
Jeremy: We have a section dealing with large data, but not on specific t
ypes (e.g., geometry)
Jeremy: Another BP referes to an issue mentioned during the plenary is t
hat there are way too many ontologies, but eventually people don't use t
hem, and prefer JSON/GeoJSON.
Ed: There's an open issue about not being to much focussed on RDF. We ar
e not going to say that RDF is the only solution.
Jeremy: Another BP is about providing the right level of complexity and
the right encoding depending on the use case / application. And in some
cases you don't need to provide a precise geometry (e.g., the boundaris
of the American West).
Jeremy: Another issue is about CRSs. A lot of Web people don't (want to)
  know about CRSs...
... Another point is about relative positions ...
... positional accuracy.
??: [explaining the use case about ability to refer multiple representat
ions to the same resource] ...
... example is a building, described in XML, in a LD way, as a polygon o
n a map a CAD project. People relates them as the same thing, but this i
s not the case, because they are describing the same thing from differen
t perspectives.
Jeremy: [moving on] Related points are make links visible on the Web, ex
press relationship, making data indexable by search engine, link to the
real things and not to their representations.
??: Do you link to other BPs.
Ed: Yes, the principle is not reinvent.
??: We should also prevent geo people from doing some things. People sti
ll thinks at geo data as maps, and, e.g., place textual labels as spatia
l objects themselves.
Jeremy: [issue added]
??: As a data publisher, you don't know who's using your data, for which
  use cases. It might be good to point to solutions that can be used to e
xpose data in a way able to address the most common applications.
Paul van Genuchten: I have a problem with the word "publisher". This may
  give people the wrong idea that LD is only about data publication, wher
eas this should affect also how data are maintained internally (e.g., th
is applies to IDs / URIs).
??: Use case missing [missed it]
Linda: [addressing people from IGN] Do you use LD?
??: Just for research work, at the moment.
??: We are actually publishing the administrative boundaries of France.
It's experimental, currently.
Jeremy: You can then help testing the BPs.
??: Accuracy, vagueness are key issues for us.

END OF MEETING

Phil's Notes

    <scribe> Agenda: Ad Hoc

    scribe; phila

    <scribe> scribe: phila

    Tour de table shows guests are mostly developers looking for
    guidance

    Jeremy: Show the We Need You slide with some questions we're
    looking to answer
    ... Shows the BP doc [6]http://w3c.github.io/sdw/bp/

       [6] http://w3c.github.io/sdw/bp/

    Ed: Gives overview of the other deliverables of the WG

    JT: Goes through the document.
    ... Begins with the audience section - who is this for
    ... Describes the Issue flags etc.
    ... Issues link to GH Issues
    ... Or can write to the mailing list
    ... Describes the 5 themes seen in the BP doc

    Ed: Talks about incremental steps from small to 5* etc.

    JT: Theme 1 - assignment of identifiers
    ... Gets dangerously close to HR-14
    ... So doc tries to give advice on what you should give
    identifiers to.
    ... Describes the structure of a BP
    ... Talks about difference between intended outcome and
    possible approaches to implementation.
    ... Do you want concrete implementation - which may go out of
    date - or do you want "create this outcome" but with less
    guidance.

    Guest: In publication, we make a lot of use of flow charts.
    ... Maybe put in a chart with the different star levels and
    assign BPs to each one.

    JT: That might work as it shows you where to start.
    ... We also have to think about different starting points,
    someone with an SDI cf. a Web Dev

    Guest: When I got the book about Linked Data, I looked at the
    index and decided where I was going to start.
    ... A flow chart would have been helpful.

    JT: Refers to DWBP which has used a visual method for
    navigation.

    Ed: An abstract description cf. more practical

    Guest2: Talks about low hanging fruit and bookmarking pages
    etc. The longer term vision is useful.

    Guest3: I think you need both. You need more proven technology
    section and maybe a future-looking approach.
    ... A wiki in the airline industry where there are very strict
    regulations, and there's a draft section, and the sections are
    clearly delineated with colours etc.

    Guest4: I think you need concrete examples for each BP.

    Ed: We do have a clear internal vision that we want to help
    implementers to grab bits of code and start using it.

    Guest5: If you're talking about code you need to talk about
    platforms etc.

    Ed: Yes, and we recognise that any choice will be out of date
    in time.

    Guest5: There is no one choice of today. JSON(-LD)? works on
    all platforms, it's a data structure, but be careful of
    choosing a development platform.

    JT: If we were to choose a platform...

    phila: W3C/Web Platform etc.

    Guest5: I don't see examples

    JT: Not yet...
    ... I want to come back to choice of platform. Do you want to
    see them in etrms of OGC services, or Python snippets.

    Guest: We have traditional data providers, like governments,
    that we like to tickle to share their data on the Web. Only a
    minority will be tickled.
    ... There's the other groups that need regulation from above.
    ... You dopn't always target the data provider, you target the
    one above the provier to encourage them to push for change.
    ... You may need a differnet documentation approach.

    Ed: You're saying it's more about policy and context?
    ... More the business case?

    Guest: yes.

    JT: Something that a manager can pass on

    Ed: It can be more challenging to make the case for LD to
    policy makers who have spent a lot of money building SDIs.

    Andrea: Another point might be - how can I implement this as a
    layer on top of the existing structure? What are the
    incremental steps? - useful?

    Guest: I say from a tech POV, everything is possible.
    ... I find a layer on top of a layer may not be the best way.

    Andrea: I was thinking of alternatives, that you can start from
    scratch, or slowly add bits.

    Guest: Our experience is that companies like incremental steps
    that show benefits. And LD allows you to start small.
    ... You can guide comapnies towards this practice rather than
    enforcing it from above. benefits win over force.

    Ed: There's a perspective we haven't communicated - beyond the
    SDI world, there is data on the Web that is not explicitly
    spatial that we can make explicitly spatial.
    ... We can say, if you're just building Web pages, how do you
    add a little bit of structured data to make location
    understandable.

    JT: So we want BPs that say if you do this little thing, you'll
    see that benefit.
    ... BP2 is about pointing people to reuse existing
    authoritative identifiers. NL has URIs for dykes, buildings
    etc.
    ... reuse geonames, there are lots of IDs there. Network
    Effect.

    ChrisLittle: We now have a target of no more than 4 links
    (facebook thing)

    JT: There are situations where you can't use somebody else's
    identifier, but where you can, you should.

    Chris: The first star, the licensing side - that's hard. We now
    have 2 categories: open data and open data that is managed -
    which you need to use responsibly.

    JT: We say that you probably have IDs all over the place, but
    they're local ones. How can we help you create URIs from those
    local IDs.
    ... BP4 - changing over time, when tyo mint a new ID.
    ... Guidance on giving identifiers to larger parts of info.
    ... You may only want a smaller subset.
    ... This sparked a lot of debate (see
    [7]https://lists.w3.org/Archives/Public/public-sdw-comments/201
    5Dec/0000.html onwards)

       [7] 
https://lists.w3.org/Archives/Public/public-sdw-comments/2015Dec/0000.html

    Guest: Why is this is the SDW, not DWBP?

    JT: Because it came up looking at large spatial
    datasets/coverages. DWBP didn't come up against it and don't
    have an answer.

    Ed: The heirarchy of places is relevant as well. One place is
    within another. Looking more now at placial, not spatial.

    Guest: This is relevant to such a large audience, it needs to
    be seen from one angle and then work on others. Can you
    actually tackle the big topics.

    JT: When we look at the broader problems, we do so through a
    spatial lens - or we'd never get it finished.
    ... Describes the how to test section. We want to yo be able to
    know if you've followed the BP.

    Guest: You're talking about large datasets here. A topic we
    looked at in our research was looking at large objects, so at
    the entity level.
    ... Devleopers don't need to detailed accuracy that comes with
    some objects.

    JT: We have a section talking about larger data, but not large
    objects.

    issue: Need a BP on handling large spatial objects, i.e. with
    lots of detailed geometry. (BP doc)

    <trackbot> Created ISSUE-39 - Need a bp on handling large
    spatial objects, i.e. with lots of detailed geometry. (bp doc).
    Please complete additional details at
    <[8]http://www.w3.org/2015/spatial/track/issues/39/edit>.

       [8] http://www.w3.org/2015/spatial/track/issues/39/edit

    JT: Talks about Expressing spatial data section.
    ... People use what they use. We want to try and provide clear
    guidance on what to do, including if you have a tool chain in
    place.

    Ed: Draws attention to Issue-225, being too RDF-focussed.
    ... We're not going to say that the only solution is to use
    RDF.

    JT: Some people will have heard of JSON-LD, a LD dialect that
    lets you get a long way into RDF without actually touching the
    Semantic Web baggage.
    ... Talks about BP6 - which will give us a way to give lost of
    differnet examples of how to do stuff.
    ... There is a spatial thing called 'The American West' - which
    doesn't have an agreed boundary.
    ... Not everything has to have a precise geometry.
    ... Talks about the importance of CRSs
    ... When you get to particular application domains, you might
    need a different CRS.
    ... BP9 talks about relative positions.
    ... 11 Talking about time.

    Linda: Sander came to me with a use case that might give us a
    new requirement that might be relevant to the spatial
    relationships.
    ... They're trying to integrate data from different
    perspectives. One might talk about the properties of the
    building itself, others will talk about the representation of
    the thing.

    Sander steps up to the white board...

    Sander: I want to show why it matters that we have these
    relationships.
    ... Gives an example of a building with doors, windows.
    ... And we have an info-source that might be an XML doc
    ... Has IDs for the building, the door, the window.
    ... And there might be a map that shows the polygon that
    represents that building.
    ... This is the kind of data we work with for construction
    companies.
    ... Come say we have a LD approach with classes for the
    building, hasPart Door etc.
    ... Talking about tangible objects
    ... But others would talk about XML elements with attributes.
    ... We need a property to say we're talking about the same
    thing.
    ... Something that says I know these things are expressed
    differently but they are about the same thing.
    ... We need to bring these things together in a way that makes
    sense and we can't use owl:sameAs
    ... ArcGIS uses a table to represent things and people will say
    that a cell in the table is the same as the thing itself

    Ed: In the GIS world, everything is digital, we didn't make the
    link to the real world.

    Sander: With LD, people see that you can link thing, but if we
    want to crawl data and link it, we need to be clear what the
    differences are.

    Ed: Gives his door lock example

    Linda: Not everyine in the WG is convinced that we need to
    address this.

    Sander: It's not for me to say whether it's for this WG to
    tackle.

    Guest: Are you talkking to the lock or the wi-fi card in the
    lock?

    JT: We're beginning on this journey with a plan to say that you
    need to identify real world things separately from data about
    them.

    Guest: What kind of datasets are you working with?

    Sander: We have designs, the building process, maintenance -
    lots of moments of data needs. For design, we're talking about
    CAD. Spatial info we're talking about location of course, this
    plot of land etc.
    ... relations to otehr areas of land.
    ... And then there are aspects such as validation - whether the
    buildings meet regulations etc.
    ... Do the objects fulfil the needs we extablished. Differnet
    people, differnet tooling, differnet priorities.
    ... These need to be brought together for the owner of hte
    road/building who needs all those things.
    ... We want to be able to say more about all those things.

    guest: Why don't you use the standards of the building sector?

    == Discussion ensues==

    Sander: everyone has their own standards. merging them is
    difficult when you want to understand different aspects.

    JT: calls time out
    ... The data integartion one is one we need to include.
    ... The specific case would be a lunch time discussion.
    ... Runs through remaining BPs quickly

    Sander: Are you linking to other BPs such as self-describing
    APIs etc.

    Ed: We want to avoid creating anything new if we can avoid it.

    JT: Please point them out to us and then we can use them -
    don't assume we're going to do the right thing.
    ... Returns to his initial questions about software
    architectures, formats etc.

    Guest: There are things that we should try and stop people
    doing. Like people using points with a label, but when you zoom
    in it all goes wrong.
    ... People think of spatial data as maps - that's the problem.
    ... On people's desktop, it works. But instead of having an
    annotation it becomes anotehr spatial object.

    issue: Making sure that attributes on maps, eg labels, are not
    traeted as spatial objects themselves.

    <trackbot> Created ISSUE-40 - Making sure that attributes on
    maps, eg labels, are not traeted as spatial objects
    themselves.. Please complete additional details at
    <[9]http://www.w3.org/2015/spatial/track/issues/40/edit>.

       [9] http://www.w3.org/2015/spatial/track/issues/40/edit

    linda: We need examples and I was wondering whether Edward
    could provide such examples for things that change over time.
    ... is it going to be persietent? Can we link to it?
    ... We're talking about a Web site that lists all the Dutch
    municipalities that exist and those that used to exist.

    Guest: The hard thing in general is that publishers don't know
    waht their data is being used for, so you need to be flexible
    in what you provide.
    ... Might be intersting to promote things like aggregation,
    faceting, to make data more accessible.
    ... It also relates to the size of geospatial datasets. A list
    of all restaurants in NL is long. But if I only want certain
    types in certain areas, I don't want all that.
    ... If we can use these kinds of mechanisms this would help.

    JT: That gets us into subsetting.

    BartvanLeeuwen: A large part of what you're saying is/should be
    covered in DWBP

    Guest: Makes the point about the workd 'publish' - gives people
    the idea that LD is all about publication. Actually it's about
    *storing* data on the Web
    ... I really don't like the word publish in this context.
    ... People do a lot of transformation before they publish it as
    INSPIRE, rather than actually use INSPIRE.

    Dog Food

    Ed: WE also need to be aware of the flip case of that. We don't
    want to require people to publish all their data.
    ... It's obviously easier if you use the same IDs inside and
    outside the firewall.

    issue: Don't rely on the word 'publish' - it's about storting
    data on the Web (BP Doc)

    <trackbot> Created ISSUE-41 - Don't rely on the word 'publish'
    - it's about storting data on the web (bp doc). Please complete
    additional details at
    <[10]http://www.w3.org/2015/spatial/track/issues/41/edit>.

      [10] http://www.w3.org/2015/spatial/track/issues/41/edit

    JT: We are keen to have examples of stuff you're doing in the
    wild. So thank you - please let us know.

    Guest: You're talking about your use cases. We make books -
    we're struggling to use Top Braid Composer, linking different
    parts of our system.

    JT: Talks about the BBC Linked Data Platform.

    Linda: What is IGN's interest in LD?

    IGN: We have a lot of interest in it for our research
    proposals. We've done a lot but it takes time. We're discussing
    use of URIs as IDs
    ... Some people are afraid, whether they have to be
    dereferencable or not etc.
    ... Talking about interlinking. I'll be presenting something on
    this later today.
    ... We are publishing the Administrative units on France as LD,
    accessible via a SPARQL endpoint.

    JT: So you'd be in a good position to road test the BPs
    ... is this doc accessible to me? Does it help me?

    IGN: Issue of accuracy, merging sets at different levels of
    granularity.

    Ed: We're not going to solve fundamental problems of spatial
    data - we might just raise a warning flag.

    JT: Draws the meeting to a close.

Summary of Action Items

Summary of Resolutions

    [End of minutes]
      __________________________________________________________

Received on Wednesday, 10 February 2016 11:58:58 UTC