Limiting a Search by URL

First of all, I apologize for starting a discussion and then not getting
back to any of you.  Unfortunately, another project came up and it took the
higher priority.  But I wanted to thank all of you for your thoughts on how
to limit a search by part or all of a URL.  I've gone through all the email
I received and I'll try to pull together what I think I know and what I
know I don't -- I'm hoping ya'll will let me know where I'm off base.

First of all, a recap.  Here's what we want to do:  create a limiter so
that the search term might be "nasa.gov" and we would receive hits that
include both "http://jpl.nasa.gov" and "http://www.nasa.gov/missions.html."

I had a couple of people tell me that their ILS already does this by using
an unanchored phrase search.  I believe the attribute combination for that
would be:
Use: 1032 (Doc-id)
Relation: 3 (equal)
Position: 3 (any position in field)
Structure: 1 (phrase)
Truncation: 100 (Do not truncate)
Completeness: 1 (Incomplete subfield)

So, assuming the search term "nasa.gov," wouldn't you have to use
truncation in order to get either of the results above?  I'm assuming that
"nasa.gov" is a phrase that is being searched.  Would I receive the match
"www.nasa.gov?"

Others said they use the word structure.  This seems to me to be a good
alternative.  As Ralph pointed out, if you searched "nasa.gov," chances
would be pretty high for getting a NASA web site.  Even if I had gotten
some of the sub-domains in the wrong order, I probably still would bring it
up.

I spent a lot of time reading the message from Alan Kent.  I agree with
everything he said, but there is one thing that I didn't make clear.
Although I think most search terms used will be part of the domain name, I
don't want to limit it to that.  Using "http://www.nasa.gov/missions.html,"
I'd want to obtain that page using a search such as "nasa.gov/missions."
Using the word structure above, I could do this.

Please let me know if I've understood the conversation and if I'm missing
anything.  Again, thanks for your help!

Christine Peterson
Library Liaison Officer, Amigos Library Services
14400 Midway Road, Dallas, TX  75244-3509
800/843-8482 x191 (message only)
512/671-1580 (phone and fax)
EMAIL:  peterson@amigos.org

Received on Friday, 24 January 2003 18:29:16 UTC