- From: Tom Baker <tbaker@tbaker.de>
- Date: Sat, 3 Sep 2011 20:09:21 -0400
- To: Karen Coyle <kcoyle@kcoyle.net>
- Cc: public-xg-lld@w3.org
On Sat, Sep 03, 2011 at 01:09:58PM -0700, Karen Coyle wrote:
> Actually, none of the identifiers are coded as "data" (unique
> alphanumeric strings) rather than text. This is an entry for an
> ISBN:
>
> 020 __ $a 0375409726 (lg. print)
>
> The "identifier" is in the same text string as the parenthetical
> phrase. Ditto the LCCN, the OCLC number, and others. So the way it
> is in a MARC record, the ISBN may not be entered as a unique string.
> Only the items in the fixed fields have that characteristic.
> Sometimes there isn't a parenthetical phrase in the 020 field, but
> that's an accident of the particular situation.
>
> 020 __ $a 1844134571
>
> So in fact nearly all of the variable field data in MARC (with a few
> exceptions, but very few) are untyped text strings.
Oy. The text in front of me had been ("BEFORE"):
Most information in library data is encoded as display-oriented text
strings. There are a few shared identifiers for resources, such as ISBNs
for books, but most identification is done with text strings. Some coded
data fields are used in MARC records, but there is not a clear incentive to
include these in all records, since most coded data fields are not used in
library system functions.
Some data fields, such as authority controlled names and subjects, do have
their own associated records in separate files. These records have
identifiers that could be used to represent those entities in library
metadata. However, some of the data formats currently used do not support
the inclusion of these identifiers in existing library records.
Consequently, a number of current library system do not support their use
properly.
...which I edited into ("AFTER"):
Most information in library data is encoded as display-oriented text
strings. Some of the resource identifiers used in library data are based on
unique alphanumeric strings, such as ISBNs for books, but most
identification is done using words and names. Some data fields in MARC
records are coded uniquely, but there is no clear incentive to include
these in all records as few of them are used for library-system functions.
Some data fields, such as authority-controlled names and subjects, have
associated records in separate files, and these records have identifiers
that could be used to represent those entities in library metadata;
however, the data formats in current use do not always support inclusion of
these identifiers in records, so many of today's library systems do not
properly support their use.
I'm not sure I understand the significance of Peter's distinction between
"encoding" and "storing"...?
I'm thinking that the re-write of the second block ("Some data fields...")
looks okay...?
But in light of your examples, Karen, neither "BEFORE" nor "AFTER" seem to get
the point of the first block quite right. Specifically, neither version seems
to capture the point about encoding/storing/expressing ISBNs.
Can you suggest a clarification? The section in question is [1].
Tom
[1] http://www.w3.org/2005/Incubator/lld/wiki/Draft_issues_page_take2#Library_data_is_expressed_primarily_as_text_strings
Received on Sunday, 4 September 2011 00:10:07 UTC