Re: "machine readable" from Steven Adler on 2015-04-27 (public-dwbp-wg@w3.org from April 2015)

From: Steven Adler <adler1@us.ibm.com>
Date: Mon, 27 Apr 2015 11:07:28 -0400
To: João Paulo Almeida <jpalmeida@ieee.org>
Cc: Annette Greiner <amgreiner@lbl.gov>, Bernadette Farias Lóscio <bfl@cin.ufpe.br>, Eric Stephan <ericphb@gmail.com>, Ig Ibert Bittencourt <ig.ibert@gmail.com>, João Paulo Almeida <jpandradealmeida@gmail.com>, Phil Archer <phila@w3.org>, Public DWBP WG <public-dwbp-wg@w3.org>, Yaso <yaso@nic.br>
Message-ID: <OF1357FA8A.8E25430C-ON85257E34.0052F3DC-85257E34.005314D8@us.ibm.com>

Since we don't have robots with the ability to "read" data like humans do,
I agree it is the wrong term.  But it is the term everyone uses and
imperfect as it is it is what it is and we have to use it too.


Best Regards,

Steve

Motto: "Do First, Think, Do it Again"


|------------>
| From:      |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |JoÃ£o Paulo Almeida <jpalmeida@ieee.org>                                                                                                           |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| To:        |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |Phil Archer <phila@w3.org>, Annette Greiner <amgreiner@lbl.gov>, Eric Stephan <ericphb@gmail.com>, Ig Ibert Bittencourt <ig.ibert@gmail.com>, Yaso|
  |<yaso@nic.br>, Public DWBP WG <public-dwbp-wg@w3.org>, Bernadette Farias LÃ³scio <bfl@cin.ufpe.br>                                                 |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Date:      |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |04/24/2015 01:23 PM                                                                                                                               |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Subject:   |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |Re: "machine readable"                                                                                                                            |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|





Dear All,

I too find the qualification Â³machine readableÂ² quite problematic.

I raised this last year in the scope of the UCR document:


> About the whole Â³machine readableÂ² debate, it is of course a different
>story:
> http://www.w3.org/2013/dwbp/track/issues/36


> Similar to Makx, I find it hard to live with the sloppy Â³machine
>readableÂ² qualificationÅ   but, in order to come to a constructive
> suggestion for this other issue as well, perhaps we could say:

> R-FormatMachineRead(able)

> Metadata should conform to standard formats that aim at facilitating
>automated processing



I would prefer avoiding machine readable. Â³StructuredÂ² could help
informally, but is also not precise.

We need to state the quality we want this artifact to have. Perhaps it is
that it is "amenable to automated processing", but this could of course
also be interpreted as vague and too broad.

Makx said "Maybe the requirement is rather that data should be published
in formats that are appropriate for its intended or potential use?Â² This
is the key aspect: making explicit the quality that the artifact needs to
have such that it can be used.

I would say that Â³structureÂ² is needed because we use the structure to
document interpretation rules to establish the semantics of "structured
dataÂ². But that is not enough...

Regards,
JoÃ£o Paulo



On 24/4/15, 1:47 PM, "Phil Archer" <phila@w3.org> wrote:

>Good points, Annette.
>
>I think this is what the 1st and 2nd stars of LOD are getting at.
>
>* Available on the web (whatever format) but with an open licence, to be
>Open Data
>** Available as machine-readable structured data (e.g. excel instead of
>image scan of a table)
>
>Note that here, Excel is included in machine readable. The keyword here
>is structured.
>
>So I think we should focus on the word structured, as you suggest, and
>be very cautious about using the phrase machine readable, perhaps
>avoiding it altogether.
>
>Phil.
>
>On 24/04/2015 15:42, Annette Greiner wrote:
>> Re the definition of machine readable as "Data formats that may be
>>readily parsed by computer programs without access to proprietary
>>libraries. For example, CSV, TSV and RDF formats are machine readable,
>>but PDF and Microsoft Excel are not.Â²
>>
>> I disagree with this definition. All proprietary computer file formats
>>are machine readable. If we want to talk about nonproprietary formats,
>>we should call them nonproprietary formats. If we want to talk about
>>structured data formats, we should call them structured data formats.
>>
>> I just did a search through the BP doc Â² for Â³machine readableÂ², and I
>>think there are two ways it gets used. In the introduction, it is used
>>in the sense of making it easier for machines to parse and do useful
>>things with data. That could be clarified by changing it to Â³more
>>readily machine readableÂ² or some such. Elsewhere, it gets used to mean
>>giving structure to the data. In this latter case, which is the
>>majority, I think we should change it to Â³structuredÂ².
>> -Annette
>> --
>> Annette Greiner
>> NERSC Data and Analytics Services
>> Lawrence Berkeley National Laboratory
>> 510-495-2935
>>
>>
>>
>
>--
>
>
>Phil Archer
>W3C Data Activity Lead
>http://www.w3.org/2013/data/

>
>http://philarcher.org

>+44 (0)7887 767755
>@philarcher1
>

Attachments

image/gif attachment: graycol.gif
image/gif attachment: ecblank.gif

Received on Monday, 27 April 2015 15:10:57 UTC