Re: Defining "Open" Data (was RE: no F2F3 in 2009 -- Re: Agenda, eGov IG Call, 11 Nov 2009)

As Todd points out several forms of availability and openess exist. The 
factual openess depends both on intended usage and on how the data 
is/can be provided. The latter introduces potential problems regarding 
bandwith,  update frequencies etc.  To be short the factual problems of 
(re)using sources  may be quite complex. Open source is usually used in 
relation to open-source software, and this at least gives you the 
possibility to inspect the code (minimum position). Extending the 
concept open to any data source introduces all the issues mentioned.


Cheers,

Tom

Prof. dr. Tom M. van Engers
Professor in Legal Knowledge Management
University of Amsterdam/Faculty of Law
Leibniz Center for Law
Kloveniersburgwal 48
Postbus 1030
1000BA Amsterdam
+31 20 525 3494
+31 20 525 2179
www.LeibnizCenter.org
vanEngers@uva.nl



Anne L. Washington wrote:
> Amen to Todd's definitions.
>
> This level of differentiation is especially crucial for our group to 
> grasp when dealing with public officials. The public sector may feel 
> they have made the data "open" by putting pdfs on the web. They don't 
> understand the subtle arguments we make about open data.
>
> They are thinking printer-friendly and
> we are talking processing-friendly.
>
> Maybe we could publish a quantitative score for measuring where 
> projects stand on the ladder towards total "open data"?
>
> It is the type of project that make sense to emerge from a standards 
> group. We could use this type of rubric as a scale to score existing 
> projects or our suggested improvements for future ones. It would help 
> public officials know where they are versus where they could be. It 
> could be part of the the first group that John Sheridan suggested >>> 
> - Government Linked Data, Techniques and Technologies. I have more 
> ideas that we could flush out if there is any interest.
>
>
> Anne L. Washington
> Standards work - W3C egov - washingtona@acm.org
> Academic work - George Washington University http://home.gwu.edu/~annew/
>
>
> On Thu, 12 Nov 2009, Todd Vincent wrote:
>
>> This is an interesting topic.  I find that the term "Open" is used 
>> many, many different ways and often to the advantage of the person 
>> using the term.  Vendors will use the term "open" to promote the 
>> technology and data they sell.  Likewise, anti-vendors will use the 
>> word "open" to claim that a given technology or data is not 
>> sufficiently open, therefore, should not be used or is less worthy in 
>> some way.  The term "open" is overused and has lost much of its 
>> meaning over time as a result.  In the end, someone has to pay to 
>> make data available, which is generally why there is some cost 
>> recovery mechanism associated with end-use.  Nothing in this world is 
>> free and often people associate "open" with "free."
>>
>> That said, I wonder whether a matrix of logical possibilities 
>> wouldn't be helpful, after which we might document and standardize 
>> descriptive names.  Here is a stab at it:
>>
>>
>> Unavailable: You simply cannot get the data.  Data is cost 
>> prohibitive to publish. There may be security or privacy reasons not 
>> to publish.  Or, simply, no one ever thought to publish the data.
>>
>> Not Translated: Data is available, but exists in a different language 
>> than the end user's language.
>>
>> Paper: Data is available, but it is only available on paper.
>>
>> Free: Data is available at no cost and without restrictions.
>>
>> Fee Based: Data is available, but only for a fee.
>> --- Public: Fee Based: Government provides data for a fee.
>> --- Private: Fee Based: Private company provides data for a fee.
>>
>> Copyright: Data is available (in some way) but there are copyright 
>> restrictions on republication or reuse.
>>
>> Copyright with License: Data is available (in some way), there is a 
>> copyright, but also a license that allows some use (other than all 
>> rights reserved).
>>
>> Public Domain: Data is available (in some way) and is in the public 
>> domain, so there are no restrictions on use of the data.
>>
>> Electronic: Data is available electronically.
>>
>> Electronic: Web Browser or Paper-Like Electronic Document Format: 
>> Data is available but only via a web browser or an electronic 
>> document format and not in an easily parsed format (where 
>> Images/Graphics, HTML, XHTML, PDF, Word, and Word Perfect do not 
>> count as easily parsed formats).
>>
>> Electronic: Structured Format: Data is available electronically and 
>> in a structured format.  A structured format would include delimited 
>> text, spreadsheet, XML, and the like.
>>
>> Electronic: Structured Format: Schema: Data is available 
>> electronically and in a structured format.  Additionally, there is a 
>> schema available that defines the structured format.
>>
>> --- Government Schema: A government promulgates the schema. The 
>> schema may or may not be in the public domain.
>>
>> --- Standards Body Schema: A recognized standards body promulgates 
>> the schema.  Schema is licensed under a "copyleft" (perpetual, free, 
>> but with restrictions not to modify) or similar license (typical of 
>> W3C, OASIS, but not all "recognized" standards bodies).
>>
>> --- Private Schema:  A private company promulgates the schema.  The 
>> schema may or may not have licensing restrictions associated with it.
>>
>> Electronic: Browser/Viewer: Electronic data, whether structured or 
>> not, is available only via a web browser or other viewer for viewing.
>>
>> Electronic: Download: Electronic data, whether structured or not, is 
>> available to download.  Here, download means a "manual" download. 
>> Some manual user input must be done to download the data (e.g., 
>> downloading a spreadsheet or structured text file via an HTTP link or 
>> FTP) to the user's local machine.
>>
>> Electronic: Web Service: Electronic data, typically structured, is 
>> available via a web service (meant in a generic way, not specific to 
>> a technology) for machine consumption.  There is some standard, 
>> specification, or documented publication rules, such that machines 
>> can reliably access the data on an ongoing basis.  The point here is 
>> not the format of the data, but the reliability and availability of 
>> the connection to the data, so that machines can get to the data feed 
>> without human intervention.
>>
>>
>> Each of these qualities makes the data more or less "open" or 
>> "accessible" as a practical matter.  There are  many combinations of 
>> these that one could put together.
>>
>>
>>  
>> Thanks,
>>  
>> Todd
>> ===========================
>> Winchel "Todd" Vincent III
>> <xmlLegal> http://www.xmllegal.org/
>> Phone : 404.822.4668
>> Fax     : 770.216.1633
>> Email : Todd.Vincent@xmllegal.org
>>  
>> This message including any attachments and links to outside resources 
>> contains confidential information intended for a specific individual 
>> and purpose and is protected by law.  If you are not the intended 
>> recipient, please delete this message.  Disclosing, copying, or 
>> distributing this message, or the taking of any action based on it, 
>> is prohibited without permission.
>>
>> -----Original Message-----
>> From: public-egov-ig-request@w3.org 
>> [mailto:public-egov-ig-request@w3.org] On Behalf Of Niklas Lindström
>> Sent: Thursday, November 12, 2009 11:23 AM
>> To: prof. dr. Tom M. van Engers
>> Cc: peter.krantz@gmail.com; david osimo; Jose Manuel Alonso; eGovIG IG
>> Subject: Re: no F2F3 in 2009 -- Re: Agenda, eGov IG Call, 11 Nov 2009
>>
>> Hi,
>>
>> how about Open == Readable?
>>
>> Not necessarily writable of course, nor republishable. But if data is
>> inaccessible until someone buys it (or "rents" it), can it sensibly be
>> called open?
>>
>> The situation (at least in Sweden) is that lots of government/agency
>> data is totally unavailable until paid for. That is, if it at all
>> exists in some digital form. (And for that matter, is processable;
>> i.e. exists in an open format for which there are patent-free parser
>> implementations...)
>>
>> Best regards,
>> Niklas Lindström
>>
>>
>> On Thu, Nov 12, 2009 at 3:57 PM, prof. dr. Tom M. van Engers
>> <vanengers@uva.nl> wrote:
>>> Open <> Free
>>>
>>> Prof.dr. Tom M. van Engers
>>> Professor in Legal Knowledge Management
>>> University of Amsterdam/Faculty of Law
>>> Leibniz Center for Law
>>> Kloveniersburgwal 48
>>> Postbus 1030
>>> 1000 BA Amsterdam
>>> +31 20 525 3494
>>> +31 20 525 2179 (fax)
>>> www.LeibnizCenter.org
>>> vanEngers@uva.nl
>>>
>>> -----Original Message-----
>>>
>>> From: Peter Krantz <peter.krantz@gmail.com>
>>> Sent: Thu, 12 Nov 2009 12:29:01 +0100
>>> To: david osimo <david.osimo@gmail.com>
>>> CC: Jose Manuel Alonso <josema.alonso@fundacionctic.org>, eGovIG IG 
>>> <public-egov-ig@w3.org>
>>> Received:  Thu,  12 Nov 2009 03:29:01 -0800 (PST)
>>> Subject: Re: no F2F3 in 2009 -- Re: Agenda, eGov IG Call, 11 Nov 2009
>>>
>>> Hi!
>>>
>>> Open data is gaining a lot of interest in the EU right now. People are
>>> discovering that e-government services may have to be cerated by
>>> citizens as well and that a key enabler for this is access to open
>>> data (the other two being a laptop and programming knowledge).
>>>
>>> The current PSI directive (that member states should implement i
>>> national legislation) is rather vague and I hope to see a clerar
>>> definition of "open" data in a forthcoming version. Many agencies in
>>> Sweden provide data, but at a cost. You would have to pay USD 20K to
>>> get a basic digital map. This is why you don't see a FixMyStreet
>>> service here.
>>>
>>> Regards,
>>>
>>> Peter Krantz
>>> http://www.opengov.se/
>>>
>>>
>>> On Thu, Nov 12, 2009 at 10:37, david osimo <david.osimo@gmail.com> 
>>> wrote:
>>>> Jose, all
>>>> I will be in Malmo and happy to debrief. It looks like open data 
>>>> will be a
>>>> key focus, not only in our Open Declaration but also in the official
>>>> statements.
>>>> The recent Visby conference on EU ICT policy gave good priority to 
>>>> open
>>>> data. Hans Rosling of Gapminder was keynote speaker.
>>>> http://www.se2009.eu/en/meetings_news/2009/11/9/visby_agenda_creating_impact_for_an_eunion_2015 
>>>>
>>>> The conclusions of the presidency (the most important document) 
>>>> state that
>>>> 10. EU member states and community institutions should seek to make 
>>>> data
>>>> freely accessible in open machine-readable formats, for the benefit of
>>>> entrepreneurship, research and transparency.
>>>> 11. Access to and reuse of public sector information and data 
>>>> should be
>>>> improved among EU Member States. The domains of data targeted by the
>>>> Directive on the re-use of public sector information should be 
>>>> enlarged.
>>>> Document attached
>>>> Best
>>>> david
>>>>
>>>>> This is our e-government EU strategy, let's make it better! Please 
>>>>> endorse
>>>>> the open EU declaration on public services
>>>>> 2.0 www.endorsetheopendeclaration.eu
>>>> david.osimo@gmail.com
>>>> skype, twitter: osimod
>>>> http://egov20.wordpress.com
>>>> mobile: +32-498088323
>>>> On 11 Nov 2009, at 13:10, Jose Manuel Alonso wrote:
>>>>
>>>> Additionally, I should have mentioned that the plans to hold an 
>>>> event in
>>>> Brussels in Nov/Dec as the group's F2F3 have not been successful 
>>>> due to
>>>> workload on our end and on the European Commission's. Our ideas were
>>>> welcomed and we expect to retake conversation soon, likely after 
>>>> the EU
>>>> Ministerial in Malmö <http://www.egov2009.se/> next week. I expect 
>>>> to give a
>>>> more detailed update on the next call.
>>>>
>>>> -- Jose
>>>>
>>>> ps: among other things, I also had to cancel my trip to Malmö. If 
>>>> any of you
>>>> are able to attend, I hope you could debrief us on an upcoming 
>>>> group call..
>>>>
>>>>
>>>>
>>>> El 10/11/2009, a las 9:34, Sheridan, John escribió:
>>>>
>>>> Agenda, eGov IG Call, 11 Nov 2009
>>>>
>>>> 13:00Z (9:00EST, 14:00GMT, 15:00CET)
>>>>
>>>> ------------------------------------------------------------------------------------------ 
>>>>
>>>>
>>>> Chair: John
>>>>
>>>> 1. Scribe
>>>>
>>>> 2. Agenda adjustments and next meetings [2mins]
>>>>
>>>> + next meeting [25 Nov 2009]
>>>>
>>>> + next scribe
>>>>
>>>> 3. Open Actions [10min]
>>>>
>>>> 4. What's going on / coming up [30mins]
>>>>
>>>> + TPAC 2009
>>>>
>>>> + ISWC
>>>>
>>>> + Open Knowledge Foundation event [13 Nov 2009, London]
>>>>
>>>> + Jurix Conference
>>>>
>>>> 5. Discussion: Government Linked Data, Techniques and Technologies 
>>>> [45mins]
>>>>
>>>> + how does linked data support (non-RDF) data consumers?
>>>>
>>>> + strategies for modelling government data
>>>>
>>>> + essential metadata for Government Linked Open Data (eg VoiD)
>>>>
>>>> + expressing rights and licensing information
>>>>
>>>> + approaches to provenance, authority and trust
>>>>
>>>> + using RDF for Statistical Data
>>>>
>>>>
>>>> Who can participate:
>>>>
>>>> http://www.w3.org/2007/eGov/IG/participation
>>>>
>>>> How to participate:
>>>>
>>>> http://www.w3.org/2007/eGov/IG/wiki/Teleconferences
>>>>
>>>> Notes,
>>>>
>>>> On the last group call five areas were discussed, as being the main 
>>>> work
>>>> areas for the group over this year:
>>>>
>>>> - Government Linked Data, Techniques and Technologies
>>>>
>>>> - Government Linked Data, Strategies and Success Stories
>>>>
>>>> - Best Practices for using Web Technologies to Deliver Government 
>>>> Services
>>>>
>>>> - Best Practices for Long-Term Government Data Management Issues
>>>>
>>>> - Best Practices in Government Use of Social Media
>>>>
>>>> We agreed we would pick up on one each of these over the next set 
>>>> of group
>>>> calls. This call will be addressing the first of these topics, 
>>>> "Techniques
>>>> and Technologies for Government Linked Data".
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> Please don't print this e-mail unless you really need to.
>>>>
>>>> --------------------------------------------------------------------------------- 
>>>>
>>>>
>>>> National Archives Disclaimer
>>>>
>>>> This email message (and attachments) may contain information that is
>>>> confidential to The National Archives. If you are not the intended 
>>>> recipient
>>>> you cannot use, distribute or copy the message
>>>>
>>>> or attachments. In such a case, please notify the sender by return 
>>>> email
>>>> immediately and erase all copies of the message and attachments. 
>>>> Opinions,
>>>> conclusions and other information in this message
>>>>
>>>> and attachments that do not relate to the official business of The 
>>>> National
>>>> Archives are neither given nor endorsed by it.
>>>>
>>>> ------------------------------------------------------------------------------------ 
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>
>>
>>
>>

Received on Friday, 13 November 2009 11:01:34 UTC