RE: Limiting a search by URL

I believe has Dana's description exactly matches the VTLS Z39.50
implementation for searching URLs. All of this is user configurable in the
VTLS system. Thus VTLS users can pick a set of tags (any or all) like 856/u,
505/u, 514/u, 520/u, 530/u, 545/u, 552/u, 555/u and 583/u and have then
indexed as the URL search tags. Users can also pick a use attribute to
assign to this index. Proximity searches and truncation is also supported. I
think that standardizing on a use attribute (should it be 1032 as suggested
by Chris or 5856 as used by OCLC or something else?) for URL searches would
help. One could always search on "anywhere" and get reasonably good results
for URLs but we would prefer the use of a standardized use attribute for
this purpose.

Vinod Chachra
President, VTLS Inc.
1701 Kraft Drive
Blacksburg, VA  24060

Phone: (540) 557-1200
Fax: (540) 557-1210


-----Original Message-----
From: www-zig-request@w3.org [mailto:www-zig-request@w3.org]On Behalf Of
Dietz,Dana
Sent: Friday, January 03, 2003 10:50 AM
To: www-zig@w3.org
Subject: RE: Limiting a search by URL



OCLC FirstSearch allows searching on URLs through a keyword search.  The URL
is broken into "words" at punctuation marks so that
http://www.oclc.org/firstsearch/journal=1002-3456;screen=basic could be
found by searching any of the following words alone or in combination: www
oclc org firstsearch journal 1002 3456 screen basic.  Use attribute 5856 is
used for this search; the following fields are searched in this index:
856/u, 505/u, 514/u, 520/u, 530/u, 545/u, 552/u, 555/u and 583/u.

This is what we're currently doing but we're interested in new methods if
anyone's got suggestions.

Dana

Dana Dietz
WorldCat Services -- Product Support
Online Computer Library Center, Inc.
6565 Frantz Road, MC 736
Dublin, OH 43017
dietzd@oclc.org  |  800-848-5878 ext. 5064


-----Original Message-----
From: Chris Peterson/Amigos [mailto:peterson@amigos.org]
Sent: Thursday, January 02, 2003 5:38 PM
To: www-zig@w3.org
Subject: Limiting a search by URL



The Texas Z39.50 Implementors Group is working on a profile that provides
the ability to limit by URL.  That is, we would like to be able to limit by
"nasa.gov" and receive hits that include "jpl.nasa.gov" as well as
"nasa.gov/missions.html."

Here is how we have defined the search right now:

Use -- 1032 -- doc-id
Relation -- 3 -- equal
Position -- 3 -- any position in field
Structure -- 108 -- string
Truncation -- 3 -- left and right truncation
Completeness -- 1 -- incomplete subfield

Ray suggested that structure 108 might be problemmatic.  Are there better
ways to do this?  All comments are welcome.

Christine Peterson
Library Liaison Officer, Amigos Library Services
14400 Midway Road, Dallas, TX  75244-3509
800/843-8482 x191 (message only)
512/671-1580 (phone and fax)
EMAIL:  peterson@amigos.org

Received on Friday, 3 January 2003 14:24:48 UTC