W3C home > Mailing lists > Public > www-lib@w3.org > October to December 1999

webbot

From: Pauline Field <Pauline@SurfNUSA.net>
Date: Sat, 20 Nov 1999 13:04:10 -0800
Message-ID: <38370CCA.64B5F58C@SurfNUSA.net>
To: www-lib@w3.org
trying to use the sample script robot.sh:

./robot.sh http://www.w3.org/Robot/ http://www.w3.org robot -vp

and I get the following results:

Warning..... Can't open directory
/ArchiveBrowser/|/History/|/member/|/team/
Stat fails.. on "/ArchiveBrowser/|/History/|/member/|/team/" -- giving
up (errno 2)
Load File... Not found - even tried content negotiation
Load End.... Request ended with code -1

Can anyone help me with these errors?

# begin robot.sh
#!/bin/sh
if [ $# -lt 3 ]
then
 echo "A simple example of how the libwww robot can be used"
 echo "For a full description, see"
 echo
 echo "        http://www.w3.org/Robot/User/CommandLine.html"
 echo
 echo "Usage: $0 RootURI ImageRootURI LogPrefix [ flags ]"
 echo
        echo "where"
 echo "        RootURI       is the URI prefix for links, for example
http://www.w3.org/Robot/"
 echo "        ImageRootURI  is the URI prefix for inlined images, for
example http://www.w3.org"
 echo "        LogPrefix     is the prefix for log files, for example
robot"
 echo "        flags         are any additional command line flags, for
example -vp"
 echo
 echo "See"
 echo "        http://www.w3.org/Robot/"
 echo
 echo "for more information"
 exit 1
fi

ROOT=$1
IMGROOT=$2
LOG=$3
FLAGS=$4

ROBOT=webbot

${ROBOT} ${FLAGS} -q -ss -n -depth 99 \
-exclude "/ArchiveBrowser/|/History/|/member/|/team/" \
-check
"\.gz$|\.tar$|\.tgz$|\.Z$|\.zip$|\.ZIP$|\.exe$|\.EXE$|\.ps$|\.doc$|\.pdf$|\.xplot$|\.java$|\.c$|\.h$|\.txt$|\.ppt$|\.gif$|\.GIF$|\.tiff$|\.png$|\.PNG$|\.jpeg$|\.jpg$|\.JPE$"
\
-prefix ${ROOT} \
-img -imgprefix ${IMGROOT} \
-l ${LOG}-log-clf.txt \
-alt ${LOG}-log-alt.txt \
-hit ${LOG}-log-hit.txt \
-rellog ${LOG}-log-link-relations.txt -relation stylesheet \
-lm ${LOG}-log-lastmodified.txt \
-title ${LOG}-log-title.txt \
-referer ${LOG}-log-referer.txt \
-negotiated ${LOG}-log-negotiated.txt \
-404 ${LOG}-log-notfound.txt \
-reject ${LOG}-log-reject.txt \
-format ${LOG}-log-format.txt \
-charset ${LOG}-log-charset.txt \
${ROOT}
# end robot.sh

--
Pauline Field
SurfNDevelopment Corporation
(423) 821-3463, Fax (423) 821-3446
3821 St Elmo Ave, 2nd Floor, Chattanooga, TN 37409
Received on Saturday, 20 November 1999 13:08:16 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 23 April 2007 18:18:35 GMT