- From: Mercurial notifier <nobody@w3.org>
- Date: Thu, 05 Aug 2010 14:46:52 +0000
- To: link-checker updates <www-validator-cvs@w3.org>
changeset: 2:ddeae812731c
tag: import-from-validator
user: ville
date: Thu Mar 18 21:44:14 2004 +0000
files: ChangeLog ChangeLog.old MANIFEST META.yml Makefile.PL README bin/checklink.pod
description:
First CPANified version, copied here from the validator dir.
diff -r 55210bc792ab -r ddeae812731c ChangeLog
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/ChangeLog Thu Mar 18 21:44:14 2004 +0000
@@ -0,0 +1,3 @@
+2004-02-18 Ville Skyttä <ville.skytta@iki.fi>
+
+ * First CPAN version.
diff -r 55210bc792ab -r ddeae812731c ChangeLog.old
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/ChangeLog.old Thu Mar 18 21:44:14 2004 +0000
@@ -0,0 +1,856 @@
+2004-02-01 13:07 ville
+
+ * checklink.pl: 2004.
+
+2003-11-25 22:46 ville
+
+ * checklink.pl: Markup improvements: "fussy"-clean, towards HTML
+ 4.01 Strict and ISO-HTML.
+
+2003-11-24 21:11 ville
+
+ * checklink.pl: Fix HTML "escaping" of URIs. Some were escaped
+ twice, some not at all.
+
+2003-11-22 17:40 ville
+
+ * checklink.pl: Check recursion scope also for redirected
+ documents.
+ http://lists.w3.org/Archives/Public/www-validator/2003Nov/0074.html
+
+2003-11-22 17:20 ville
+
+ * checklink.pl: Improve "To do" message wrapping in command line
+ mode.
+
+2003-11-22 17:17 ville
+
+ * checklink.pl: Fix URI->abs() usage in absolutizing URIs.
+
+2003-10-20 23:19 ville
+
+ * checklink.pl: Make ip_allowed() not choke on URIs that don't
+ implement host().
+ http://lists.w3.org/Archives/Public/www-validator/2003Oct/0056.html
+
+2003-09-15 21:31 ville
+
+ * checklink.pl: Unbreak --masquerade, enhance documentation and
+ error messages. W3C Bugzilla #340, Debian bug #187791, thanks to
+ Dan Jacobson and Frederic Schutz for the report.
+
+2003-09-12 20:49 ville
+
+ * checklink.pl: Make the configuration file optional. Thanks to
+ Hugo Haas for the heads up. Also, load Config::General only if
+ needed.
+
+2003-09-11 13:46 hugo
+
+ * checklink.pl: Fixing the following: [Thu Sep 11 12:44:53 2003]
+ checklink.pl: Argument "help" isn't numeric in exit at
+ /home/hugo/bin/checklink.pl line 410.
+
+2003-07-27 19:19 ville
+
+ * checklink.pl: Don't conditionalize the Term::ReadKey requirement,
+ but defer its loading until actually needed. system('stty')
+ doesn't work under -T without laundering $ENV{PATH}, and I don't
+ feel like going there :P
+
+ Also, don't load CGI and CGI::Carp in command line mode; defer by
+ using require() instead of use().
+
+2003-07-27 18:38 ville
+
+ * checklink.pl: Implement option to disallow checking non-public IP
+ addresses; disallow by default in CGI mode and allow in command
+ line mode.
+
+ New requirement: Net::IP.
+
+2003-07-26 22:20 ville
+
+ * checklink.pl: Use Text::Wrap for beautifying command line output,
+ some other output tweaks.
+
+2003-07-26 22:16 ville
+
+ * checklink.pl: Fix redirect loop detection, and add a descriptive
+ error message.
+
+ Previously, only loops like L1 -> L2 -> L3 -> L1 were detected;
+ ones like L1 -> L2 -> L3 -> L2 resulted in infinite recursion.
+
+ Fixes <http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=188372>,
+ thanks to Dan Jacobson and Frederic Schutz for reporting this.
+
+2003-07-26 18:48 ville
+
+ * checklink.pl: Plug myself into the man page's AUTHOR section, per
+ Frederic's request :)
+
+2003-07-23 23:08 ville
+
+ * checklink.pl: Include POD documentation, courtesy of Frederic
+ Schutz.
+
+2003-06-16 00:41 ville
+
+ * checklink.pl: Document env vars affecting FTP configuration, and
+ make passive mode the default (#226).
+
+2003-06-15 17:56 ville
+
+ * checklink.pl: Trust and launder the NNTPSERVER and NEWSHOST
+ environment variables in order to support news: URIs without
+ explicit hostnames under -T.
+
+2003-06-15 17:42 ville
+
+ * checklink.pl: Call get_timestamp always without arguments.
+
+2003-06-02 20:27 ville
+
+ * checklink.pl: -? == --help, be a bit more verbose with the usage
+ of --location and --recursive (Debian bug report #189642).
+
+2003-05-24 23:32 link
+
+ * checklink.pl: Merging from validator-0_6_0-branch, at tag
+ validator-0_6_2-beta1, to HEAD.
+
+2003-04-19 22:50 ville
+
+ * checklink.pl: - Abort check in case of non-understood command
+ line options (#203, Debian bug #187792). - Allow bundling of
+ command line options. - Bump 2002->2003 in copyright.
+
+2003-02-05 23:47 ville
+
+ * checklink.pl: Skip 'base' tags with empty hrefs (#138). Put HTML
+ validity a bit more prominently into beginning of results page.
+
+2003-02-02 21:03 ville
+
+ * checklink.pl: - Implemented external configuration file stuff
+ (#127). We now require Config::General >= 2.06. - Fixed basic
+ authentication (#122). If the trusted domain is not specificed,
+ we only send the authentication to the same host which requested
+ it. Also added a notice about this to the error page/command
+ line message if no suitable authentication info was given. -
+ Changed text/javascript to application/x-javascript, thanks to
+ Nicolas Roeser for the report. (see
+ <http://lists.w3.org/Archives/Public/www-validator/2003Jan/0158.html>)
+ - Fixed missing results from the summary when a file:// URI was
+ present (also reduces number of error log warnings emitted). -
+ Command line --help clarifications, reformatting. - Code
+ cleanups.
+
+2003-01-07 22:07 ville
+
+ * checklink.pl: Fix markup in "Authorization Required" error
+ document.
+
+2002-12-09 02:16 ville
+
+ * checklink.pl: - Fix multiple brainos in cookie code.
+
+2002-12-08 16:47 ville
+
+ * checklink.pl: - Rewrote recursion scope checking, was failing
+ with URIs with no slash after the hostname; eg.
+ <http://validator.w3.org> caused checklink to wander to other
+ sites (#115). - Report CVS revision as version number.
+
+2002-12-07 21:55 ville
+
+ * checklink.pl: - Send Accept: application/xhtml+xml, text/html,
+ */*;q=0.5 (#112) - Sanitize Accept-Language headers; either send
+ the ones from a browser or *. In command line mode, really
+ default to * and send the ones given with the command line
+ option. (#110) - Add messages for codes 303 (#111), 307 and 406
+ (#55). - Add "for" attribute for all <label>'s and the
+ corresponding id's so that MSIE users can enjoy clicking the
+ labels. - Some internal cleanup.
+
+2002-11-23 23:37 ville
+
+ * checklink.pl: - Options can be saved in a cookie (#56). - Added
+ a JavaScript check against submitting empty URIs. - Style
+ tweaks, ala Validator. - Point to new docs location (relative).
+
+2002-11-23 17:06 ville
+
+ * checklink.pl: - Tweak "form page" layout. - Don't say "Valid
+ anchors!" if no anchors found. - Fix HTML output from command
+ line.
+
+2002-11-22 01:27 ville
+
+ * checklink.pl: Output and style nits.
+
+2002-10-27 15:04 ville
+
+ * checklink.pl: Removed the debug logic and reworked the guess
+ whether the script is being run as a CGI or from the command
+ line; the problems this caused outweigh its usefulness [1]. Oh,
+ and this fixes #26.
+
+ [1] <http://nagoya.apache.org/bugzilla/show_bug.cgi?id=13914>
+
+2002-10-27 14:08 ville
+
+ * checklink.pl: Unbreak the source download link (introduced in my
+ previous commit). Found by checking the output of checklink with
+ itself. Yay :)
+
+2002-10-27 12:55 ville
+
+ * checklink.pl: - Load Getopt::Long only in command line mode. -
+ Tighter lexical scoping for variables. - Output version/revision
+ number consistently everywhere.
+
+2002-10-27 01:43 ville
+
+ * checklink.pl: Use keep-alive if the installed LWP is >= 5.6.
+
+2002-10-27 00:49 ville
+
+ * checklink.pl: - Made W3C::UserAgent more OO avoiding use of
+ W3C::CheckLink's lexicals. - Tuned HTTP header sending to make
+ mod_perl 1.99_05 happy. - Some style tweaks; reorganized
+ use()'s, added missing ones.
+
+2002-10-26 22:19 ville
+
+ * checklink.pl: Swap $REVISION and $VERSION in order to be more
+ consistent with Perl stuff in general, make sure that we have at
+ least two digits in the version number.
+
+2002-10-26 22:04 ville
+
+ * checklink.pl: Whitespace only (reindentation).
+
+2002-10-26 21:29 ville
+
+ * checklink.pl: Portability update, now actually works on Win2K.
+
+ Term::ReadKey is now used for reading the password from the
+ command line if it is available, falling back to stty if it
+ isn't.
+
+2002-10-23 23:31 ville
+
+ * checklink.pl: Fix all function prototypes, and remove them from
+ methods.
+
+2002-10-23 07:32 hugo
+
+ * checklink.pl: Security fix: cross-scripting vulnerability found
+ and fixed by ITO Tsuyoshi <tsuyoshi@is.s.u-tokyo.ac.jp>.
+
+2002-08-22 04:03 link
+
+ * checklink.pl: Applying checklink-options.patch from Ville SKytta.
+ Thanks Ville! Use GetOpt::Long for command-line option handling.
+ Add &version/-V.
+
+2002-08-22 03:56 link
+
+ * checklink.pl: Applying checklink-cleanup.patch from Ville SKytta.
+ Thanks Ville! (A bunch of cleanups; police whitespace, namespace,
+ HTML, etc.)
+
+2002-08-20 04:51 link
+
+ * checklink.pl: Merging to HEAD from validator-0_6_0 branch. This
+ will be validator-0_6_0b1c1 (Beta 1, Candidate 1).
+
+2002-07-08 00:54 link
+
+ * checklink.pl: Apply "Local Validator" patch from Ville Skytta.
+ Thanks Ville!
+
+2002-06-21 23:29 eric
+
+ * checklink.pl: s/netloc/host/ to reflect API change. made it easy
+ to rerun HTTP sessions in the debugger. covering for that
+ slacker Hugo.
+
+2002-02-01 23:29 hugo
+
+ * checklink.pl: Updated copyright info
+
+2002-01-11 14:36 hugo
+
+ * checklink.pl: Added support for application/xhtml+xml documents
+
+2001-11-26 20:18 hugo
+
+ * checklink.pl: Applied patch from Henk P. Penning
+ <henkp@cs.uu.nl>: Using Time::HiRes instead of sys/syscall.ph
+ (should be more portable)
+
+ See:
+ http://lists.w3.org/Archives/Public/www-validator/2001OctDec/0224.html
+
+2001-09-21 17:15 hugo
+
+ * checklink.pl: Tiny change: escaping less characters in URIs (0-9
+ and . are no longer escaped)
+
+2001-09-17 23:23 hugo
+
+ * checklink.pl: Fixed a URI escaping problem
+
+2001-08-01 14:53 hugo
+
+ * checklink.pl: Added support for *_proxy environment variables
+
+2001-07-25 14:34 hugo
+
+ * checklink.pl: Specifying the charset (Latin-1)
+
+2001-06-27 18:59 hugo
+
+ * checklink.pl: Apllied patch from Terje Bless <link@tss.no>:
+ http://lists.w3.org/Archives/Public/www-validator/2001AprJun/0496.html
+
+2001-06-05 13:09 gerald
+
+ * checklink.pl: typo
+
+2001-04-04 22:18 hugo
+
+ * checklink.pl: Ooops, left a closing tag...
+
+2001-04-04 22:16 hugo
+
+ * checklink.pl: Fixed an HTML validity problem in the summary only
+ mode:
+
+ http://lists.w3.org/Archives/Public/www-validator/2001AprJun/0001.html
+
+2001-03-27 15:58 hugo
+
+ * checklink.pl: Added directions for inaccessible javascript: URIs
+
+2001-03-27 02:12 hugo
+
+ * checklink.pl: Got rid of $_contact_address Changed the mailto:
+ URIs and instructions to include 'checklink' in the subject when
+ sending a mail to www-validator
+
+2001-03-27 01:46 hugo
+
+ * checklink.pl: Made the "Summary only" function do the same as
+ John <rowe@excc.ex.ac.uk>'s quiet function from:
+ http://lists.w3.org/Archives/Public/www-validator/2001JanMar/0261.html
+ This is a tiny change.
+
+2001-03-27 01:33 hugo
+
+ * checklink.pl: Added a --depth option based on the patch provided
+ by John <rowe@excc.ex.ac.uk>:
+ http://lists.w3.org/Archives/Public/www-validator/2001JanMar/0261.html
+
+ Had to tweak it to make the combination of -r and -D work.
+
+2001-03-27 00:39 hugo
+
+ * checklink.pl: Maybe the base element problem is going to go away
+ completely now. I have added a change by John
+ <rowe@excc.ex.ac.uk> which basically does not trust
+ HTTP::Response's base function, and it seems to be the way to go.
+
+2001-01-22 18:00 hugo
+
+ * checklink.pl: Fixed typo
+
+2001-01-22 17:58 hugo
+
+ * checklink.pl: Made it clearer that www-validator is a mailing
+ list to avoid confusion
+
+2001-01-22 17:49 hugo
+
+ * checklink.pl: Now suggests to send bug reports and comments to
+ www-validator@w3.org Stopped suggesting people to send me email
+ when the max number of documents checked is reached in
+ recursive mode.
+
+2001-01-19 15:59 hugo
+
+ * checklink.pl: Changed the text before the input box too
+
+2001-01-19 15:52 hugo
+
+ * checklink.pl: Changed title to "W3C Link Checker" only for the
+ form instead of "W3C® Link Checker: Revision: 2.68 (c) 1999-2001
+ W3C" which was too verbose
+
+2001-01-18 23:58 hugo
+
+ * checklink.pl: Committing changes that I made a long time ago...
+
+ - Added 410 HTTP error code in the list of errors. - Simplified
+ a few regular expressions (got rid of some /'s). - Added support
+ for //... URI's. - Commented a bit more the code because I was
+ getting lost. - Simplified already_processed(). - Added a
+ --hide-same-realm option: hide 401's that are in the same
+ realm as the document checked. - Added a --masquerade option to
+ check documents on the local system as if they were on the
+ Web (based on a patch by David Carlisle <davidc@nag.co.uk>).
+ Example: checklink -q -b --masquerade
+ /afs/w3.org/pub/WWW/Math/Group/draft/cr/
+ http://www.w3.org/TR/2000/CR-MathML2-20001113/ appendixe.html -
+ 500 errors are now split in two cases: DNS problems (LWP reports
+ that as a 500...) and real internal server errors. - Fixed a
+ footer problem in the online problem. - Fixed a duplicate id
+ problem in the error code references.
+
+2000-09-25 20:38 hugo
+
+ * checklink.pl: Undid change done in 2.62 about base tag handling.
+ I'm not really sure why I did this change in the first place, I
+ might have to fix a bug later...
+
+2000-08-17 18:10 hugo
+
+ * checklink.pl: Replaced require by use and specified that
+ HTML::Parser version 3 at least is required.
+
+2000-08-15 21:03 hugo
+
+ * checklink.pl: Reverting line list code because it needs tweaking
+
+2000-08-12 00:35 hugo
+
+ * checklink.pl: Got rid of unnecessary verbosity in the line list.
+ Added a valid links message in the text version.
+
+2000-08-05 17:46 hugo
+
+ * checklink.pl: Now checks <blockquote cite="...">
+
+2000-08-05 02:20 hugo
+
+ * checklink.pl: Fixed a confusion in the use of URI::abs()
+
+2000-08-05 01:43 hugo
+
+ * checklink.pl: Do not consider base's href as a link anymore (was
+ wrong and was causing a loop in recursive mode)
+
+2000-08-03 21:38 hugo
+
+ * checklink.pl: Discovered the existence of
+ HTTP::Headers::authorization_basic
+
+2000-07-17 17:26 hugo
+
+ * checklink.pl: Fixed typo
+
+2000-07-14 00:18 hugo
+
+ * checklink.pl: Added links to the documentation
+
+2000-07-13 19:10 hugo
+
+ * checklink.pl: Added a test in the case when the encoding would be
+ identity
+
+2000-07-12 22:14 hugo
+
+ * checklink.pl: Now checks the Content-Encoding too
+
+2000-07-10 18:21 hugo
+
+ * checklink.pl: Fixed a bug due to the migration to the version 3
+ API
+
+2000-06-23 22:09 hugo
+
+ * checklink.pl: Clarified the footer
+
+2000-06-23 19:56 hugo
+
+ * checklink.pl: Increased the limit for the online version to 150
+ documents.
+
+2000-06-12 18:37 hugo
+
+ * checklink.pl: Fixed a bad wording
+
+2000-05-16 16:12 hugo
+
+ * checklink.pl: Fixed a display problem
+
+2000-05-16 15:48 hugo
+
+ * checklink.pl: Clean up. Fixed a tiny bug.
+
+2000-05-11 18:44 hugo
+
+ * checklink.pl: Cosmetic changes
+
+2000-05-10 21:32 hugo
+
+ * checklink.pl: Speeded up parsing. Fixed display problem in the
+ recursive version. Made things simpler in check_uri().
+
+2000-05-09 22:44 hugo
+
+ * checklink.pl: Next time, I'll think twice before committing: -
+ Removed superfluous test - Using the same strategy for text
+ version
+
+2000-05-09 22:36 hugo
+
+ * checklink.pl: Report fragments as really broken is the HTTP end
+ code is 200
+
+2000-05-07 18:16 hugo
+
+ * checklink.pl: Removed unnecessary test. Added a header for the
+ link results.
+
+2000-05-05 18:13 hugo
+
+ * checklink.pl: Improved presentation for 500's
+
+2000-05-05 02:33 hugo
+
+ * checklink.pl: Improved presentation for redirected broken URI's.
+ Added note about directory redirects.
+
+2000-05-05 01:58 hugo
+
+ * checklink.pl: Added anchor to the results for better
+ accessibility. Removed a lot of colors: it's more readable.
+ Added a heading for anchors report.
+
+2000-05-05 01:29 hugo
+
+ * checklink.pl: Now uses the version 3 of the API of HTML::Parser.
+ Got rid of the chunk stuff, since the new parser doesn't care
+ about the size of the data parsed (good). Added attributes
+ checked. Still not perfect, but better. Fixed a display bug in
+ the text version. Now parses <?xml:stylesheet ...?>. I'm not
+ happy about the way it's done though.
+
+2000-05-04 23:28 hugo
+
+ * checklink.pl: Changed language from 'en' to '*'. Fixed a bug in
+ the online version when the document cannot be processed.
+ Changed the presentation to make clearer what needs to be done.
+ Changed the directions to fix the links.
+
+2000-04-26 20:07 hugo
+
+ * checklink.pl: Only use XML extensions when required in the parser
+
+2000-04-11 01:10 hugo
+
+ * checklink.pl: Fixed a bug when accessing a file which does not
+ exist. Now counts the lines in <? ... ?> parts. Now reports
+ when an anchor is defined twice on the same line.
+
+2000-04-06 00:28 hugo
+
+ * checklink.pl: Changed the format of the HTML output to make it
+ accessible. Broken links sorted by priority. Fixed a display
+ issue for 200 replies. Removed useless statistics and cleaned up
+ a few things. Changed the email address to report configuration
+ problems. Added links to HTML Validator and CSS Validator.
+ Added optional Accept-Language headers.
+
+2000-03-30 01:31 hugo
+
+ * checklink.pl: Better handling of doctypes.
+
+2000-03-20 22:55 hugo
+
+ * checklink.pl: I knew I would break something changing the data
+ structures: I love it when a plan comes together! Fixed a table
+ display problem.
+
+2000-03-02 01:15 hugo
+
+ * checklink.pl: Fixed authentication problem. Added a limit for
+ the number of documents checked with the online version. Fixed
+ the -h command line option
+
+2000-02-25 00:22 hugo
+
+ * checklink.pl: Major changes: - can check documents recursively -
+ fixed a bug in the parsing of the URI in the CGI version - now
+ checks URI's for DTD's - now checks id attributes - doesn't check
+ <a name="..."> for XHTML Basic 1.0
+
+2000-02-18 00:56 hugo
+
+ * checklink.pl: Changed the way results are stored internally Added
+ an option to hide directory redirects: http://www.w3.org/TR ->
+ http://www.w3.org/TR/ Fixed some stats errors Fixed some display
+ problems Separated directory redirects in the report
+
+2000-02-11 01:03 hugo
+
+ * checklink.pl: Fixed nasty bug: when providing authentication
+ information, 401's were not reported.
+
+2000-02-09 17:41 hugo
+
+ * checklink.pl: Fixed markup problem (thanks to Patrick D. F. Ion
+ <ion@ams.org>)
+
+2000-02-09 01:11 hugo
+
+ * checklink.pl: Changed the default for HTML version to "Show
+ redirects"
+
+2000-02-08 23:48 hugo
+
+ * checklink.pl: "use strict;" which had somehow disappeared is
+ back. It helped me fixing a typo.
+
+2000-02-08 23:19 hugo
+
+ * checklink.pl: Added comments Now correctly reports broken
+ fragments with redirects
+
+2000-02-03 20:30 hugo
+
+ * checklink.pl: Now handling empty anchors. Worked on the command
+ line version. Added link to source code.
+
+2000-01-28 23:19 hugo
+
+ * checklink.pl: Added links to online versions
+
+2000-01-27 01:04 hugo
+
+ * checklink.pl: Added W3C logo
+
+2000-01-27 00:29 hugo
+
+ * checklink.pl: W3C is registered
+
+2000-01-27 00:24 hugo
+
+ * checklink.pl: Encoding HTML correctly
+
+2000-01-26 23:45 hugo
+
+ * checklink.pl: Fixed HTTP answer
+
+2000-01-21 01:00 hugo
+
+ * checklink.pl: Some more explanation
+
+2000-01-21 00:48 hugo
+
+ * checklink.pl: Fixed problem in the summary table
+
+2000-01-21 00:01 hugo
+
+ * checklink.pl: Backward compatibility with Renaud's script
+
+2000-01-20 23:59 hugo
+
+ * checklink.pl: Fixed escaped URI's problem
+
+2000-01-20 23:40 hugo
+
+ * checklink.pl: Fixed line count
+
+2000-01-18 00:16 hugo
+
+ * checklink.pl: Fixed copyright
+
+2000-01-18 00:16 hugo
+
+ * checklink.pl: Added summary and advices.
+
+2000-01-13 19:32 hugo
+
+ * checklink.pl: Now validates
+
+2000-01-13 19:27 hugo
+
+ * checklink.pl: Fixed form
+
+1999-12-15 20:03 hugo
+
+ * checklink.pl: Added authetication in the command line version.
+ Fixed a few bugs. Added proxy support
+
+1999-12-14 17:20 hugo
+
+ * checklink.pl: Fixed problem with page tested in report. Added
+ cache control headers.
+
+1999-12-05 02:31 hugo
+
+ * checklink.pl: Line count is still flaky but the CGI version does
+ what is expected.
+
+1999-12-05 01:34 hugo
+
+ * checklink.pl: Fixed redirect + 401 problem
+
+1999-12-05 01:19 hugo
+
+ * checklink.pl: More detailed output
+
+1999-12-05 01:00 hugo
+
+ * checklink.pl: Fixed problems with 401's + a few other bugs
+
+1999-12-02 03:42 hugo
+
+ * checklink.pl: Supports authentication with the CGI interface &
+ output is more accurate
+
+1999-12-01 23:52 hugo
+
+ * checklink.pl: CGI interface improved
+
+1999-12-01 22:40 hugo
+
+ * checklink.pl: Fixed various CGI interface problems
+
+1999-12-01 21:53 hugo
+
+ * checklink.pl: CVS messed up a regular expression... fixed
+
+1999-12-01 21:47 hugo
+
+ * checklink.pl: Fixed display bugs. Line count still flaky.
+
+1999-11-26 15:14 hugo
+
+ * checklink.pl: Fixed line count. Parsing speed up.
+
+1999-11-25 00:04 hugo
+
+ * checklink.pl: new version of checklink
+
+1999-05-05 09:43 renaudb
+
+ * checklink.pl: split request thing before registering in LWP UA
+
+1999-04-28 22:31 renaudb
+
+ * checklink.pl: added plenty of color codes for errors
+
+1999-03-09 00:34 renaudb
+
+ * checklink.pl: added links to CVS server
+
+1999-03-09 00:19 renaudb
+
+ * checklink.pl: now HTTP return code 500 is considered as a broken
+ link, and is highlighted using the color red
+
+1999-03-02 00:35 renaudb
+
+ * checklink.pl: modified front end
+
+1999-03-01 22:09 renaudb
+
+ * checklink.pl: added comment about prerequesite, and testing write
+ access to CVS base
+
+1999-02-21 02:25 renaudb
+
+ * checklink.pl: put more comments in the code
+
+1999-02-21 02:13 renaudb
+
+ * checklink.pl: made checklink handle 401s the right way...yoohoo
+
+1999-02-19 04:22 renaudb
+
+ * checklink.pl: added color and legend for 403s
+
+1998-11-13 03:10 renaudb
+
+ * checklink.pl: added more verbose output
+
+1998-09-28 22:24 renaudb
+
+ * checklink.pl: fixed colors on redirect and typo in title
+
+1998-09-24 03:24 renaudb
+
+ * checklink.pl: added legend
+
+1998-09-12 02:05 renaudb
+
+ * checklink.pl: fixed weird CGI error
+
+1998-09-12 00:34 renaudb
+
+ * checklink.pl: finished handling of 401s. fixed double responses
+ too
+
+1998-09-10 03:33 renaudb
+
+ * checklink.pl: 302 handled
+
+1998-09-09 23:38 renaudb
+
+ * checklink.pl: added flag for text in links
+
+1998-09-09 16:43 renaudb
+
+ * checklink.pl: added few comments
+
+1998-09-06 04:00 renaudb
+
+ * checklink.pl: add use strict and -T. almost working
+
+1998-09-05 03:19 renaudb
+
+ * checklink.pl: added smarts for redirects
+
+1998-09-04 19:12 renaudb
+
+ * checklink.pl: changed auth method fom GET to POST. yeeek.
+
+1998-09-04 04:21 renaudb
+
+ * checklink.pl: nope
+
+1998-09-03 21:34 renaudb
+
+ * checklink.pl: added version, link to original page, changing
+ title
+
+1998-09-03 03:20 renaudb
+
+ * checklink.pl: fixed auth
+
+1998-09-02 03:05 renaudb
+
+ * checklink.pl: fixed couple bugs. auth no worky
+
+1998-08-31 23:54 renaudb
+
+ * checklink.pl: added line numbers. still buggy
+
+1998-08-31 19:16 renaudb
+
+ * checklink.pl: added no-buffering
+
+1998-08-31 18:55 renaudb
+
+ * checklink.pl: added version tag
+
+1998-08-31 18:53 renaudb
+
+ * checklink.pl: first commit of team checklink
+
diff -r 55210bc792ab -r ddeae812731c MANIFEST
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/MANIFEST Thu Mar 18 21:44:14 2004 +0000
@@ -0,0 +1,8 @@
+ChangeLog Recent changes
+ChangeLog.old Older changes
+Makefile.PL
+MANIFEST
+README Start by reading this
+bin/checklink The link checker
+bin/checklink.pod Manual page for the link checker
+META.yml
diff -r 55210bc792ab -r ddeae812731c META.yml
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/META.yml Thu Mar 18 21:44:14 2004 +0000
@@ -0,0 +1,18 @@
+# http://module-build.sourceforge.net/META-spec.html
+#XXXXXXX This is a prototype!!! It will change in the future!!! XXXXX#
+name: W3C-LinkChecker
+version: 3.9.1
+version_from: bin/checklink
+installdirs: site
+requires:
+ CGI: 0
+ Config::General: 2.06
+ HTML::Parser: 3
+ LWP::UserAgent: 0
+ Net::IP: 0
+ Term::ReadKey: 2
+ Time::HiRes: 0
+ URI: 0
+
+distribution_type: module
+generated_by: ExtUtils::MakeMaker version 6.17
diff -r 55210bc792ab -r ddeae812731c Makefile.PL
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/Makefile.PL Thu Mar 18 21:44:14 2004 +0000
@@ -0,0 +1,23 @@
+use 5.00503;
+use ExtUtils::MakeMaker;
+
+WriteMakefile(
+ NAME => 'W3C::LinkChecker',
+ ABSTRACT => 'W3C Link Checker',
+ AUTHOR => 'The W3C Link Checker Team <www-validator@w3.org>',
+ VERSION_FROM => 'bin/checklink',
+ PREREQ_PM => {
+ CGI => 0,
+ Config::General => 2.06,
+ HTML::Parser => 3.00,
+ LWP::UserAgent => 0,
+ Net::IP => 0,
+ Term::ReadKey => 2.00,
+ Time::HiRes => 0,
+ URI => 0,
+ },
+ EXE_FILES => [ 'bin/checklink' ],
+ MAN1PODS => { 'bin/checklink.pod' =>
+ '$(INST_MAN1DIR)/checklink.$(MAN1EXT)',
+ },
+);
diff -r 55210bc792ab -r ddeae812731c README
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/README Thu Mar 18 21:44:14 2004 +0000
@@ -0,0 +1,44 @@
+W3C-LinkChecker
+===============
+
+This distribution contains the W3C Link Checker.
+
+The link checker can be run as a CGI script in a web server as well as
+on the command line. The CGI version provides a HTML interface as
+seen at <http://validator.w3.org/checklink>.
+
+To install the distribution for command line use:
+
+ perl Makefile.PL
+ make
+ make test
+ make install # as root
+
+To install the CGI version, in addition to the above, copy the
+bin/checklink script into a location in your web server from where
+execution of CGI scripts is allowed, and make sure that the web server
+user has execute permissions to the script. The CGI directory is
+typically named "cgi-bin" somewhere under your web server root
+directory.
+
+For more information, please consult the POD documentation in the
+checklink.pod file, typically (in the directory where you unpacked the
+source):
+
+ perldoc ./bin/checklink.pod
+
+COPYRIGHT AND LICENCE
+
+Written by the following people for the W3C:
+- Hugo Haas <hugo@w3.org>
+- Ville Skyttä <ville.skytta@iki.fi>
+- The W3C Link Checker Team <www-validator@w3.org>
+
+Copyright (C) 1994-2004 World Wide Web Consortium, (Massachusetts
+Institute of Technology, European Research Consortium for Informatics
+and Mathematics, Keio University). All Rights Reserved. This work is
+distributed under the W3C(R) Software License [1] in the hope that it
+will be useful, but WITHOUT ANY WARRANTY; without even the implied
+warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
+
+[1] http://www.w3.org/Consortium/Legal/copyright-software
diff -r 55210bc792ab -r ddeae812731c bin/checklink.pod
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/bin/checklink.pod Thu Mar 18 21:44:14 2004 +0000
@@ -0,0 +1,192 @@
+$Id: checklink.pod,v 1.1 2004-03-18 21:44:14 ville Exp $
+
+=head1 NAME
+
+checklink - check the validity of links in an HTML or XHTML document
+
+=head1 SYNOPSIS
+
+B<checklink> [ I<options> ] I<URI> ...
+
+=head1 DESCRIPTION
+
+This manual page documents briefly the B<checklink> command.
+
+B<checklink> is a program that reads an HTML or XHTML document,
+extracts a list of anchors and lists and checks that no anchor is
+defined twice and that all the links are dereferenceable, including
+the fragments. It warns about HTTP redirects, including directory
+redirects, and can check recursively a part of a web site.
+
+The program can be used either as a command-line version or as a CGI script.
+
+=head1 OPTIONS
+
+This program follow the usual GNU command line syntax, with long options
+starting with two dashes (`-'). A summary of options is included below.
+
+=over 5
+
+=item B<-?, --help>
+
+Show summary of options.
+
+=item B<-V, --version>
+
+Output version information.
+
+=item B<-s, --summary>
+
+Result summary only.
+
+=item B<-b, --broken>
+
+Show only the broken links, not the redirects.
+
+=item B<-e, --directory>
+
+Hide directory redirects - e.g. L<http://www.w3.org/TR> ->
+L<http://www.w3.org/TR/>.
+
+=item B<-r, --recursive>
+
+Check the documents linked from the first one.
+
+=item B<-D, --depth n>
+
+Check the documents linked from the first one to depth n
+(implies --recursive).
+
+=item B<-l, --location uri>
+
+Scope of the documents checked in recursive mode. By default, for
+L<http://www.w3.org/TR/html4/Overview.html> for example, it would be
+L<http://www.w3.org/TR/html4/>.
+
+=item B<-n, --noacclanguage>
+
+Do not send an Accept-Language header.
+
+=item B<-L, --languages>
+
+Languages accepted (default: '*').
+
+=item B<-q, --quiet>
+
+No output if no errors are found.
+
+=item B<-v, --verbose>
+
+Verbose mode.
+
+=item B<-i, --indicator>
+
+Show progress while parsing.
+
+=item B<-u, --user username>
+
+Specify a username for authentication.
+
+=item B<-p, --password password>
+
+Specify a password.
+
+=item B<--hide-same-realm>
+
+Hide 401's that are in the same realm as the document checked.
+
+=item B<-t, --timeout value>
+
+Timeout for the HTTP requests.
+
+=item B<-d, --domain domain>
+
+Regular expression describing the domain to which the authentication
+information will be sent. The default value can be specified in the
+checklink configuration file.
+
+=item B<--masquerade "local remote">
+
+Masquerade local dir as a remote URI. For example, the following results in
+/my/local/dir/ being "mapped" to http://some/remote/uri/
+
+ --masquerade "/my/local/dir http://some/remote/uri/"
+
+As of revision 3.6.2.19 of checklink, --masquerade takes a single
+argument consisting of two URIs, separated by whitespace. One usual way of
+providing a value with embedded whitespace is to enclose it in quotes.
+
+=item B<-y, --proxy proxy>
+
+Specify an HTTP proxy server.
+
+=item B<-h, --html>
+
+HTML output.
+
+=back
+
+=head1 FILES
+
+=over 5
+
+=item F</etc/w3c/checklink.conf>
+
+The main configuration file. You can use the L<W3C_CHECKLINK_CFG> environment
+variable to override the default location.
+
+C<Trusted> specifies a regular expression for
+matching trusted domains (ie. domains where HTTP basic authentication, if
+any, will be sent). For example, the following configures only the w3.org
+domain as trusted:
+
+ Trusted = \.w3\.org$
+
+C<Allow_Private_IPs> is a boolean flag indicating whether checking links
+on non-public IP addresses is allowed. The default is true in command line
+mode and false when run as a CGI script. For example, to disallow checking
+non-public IP addresses, regardless of the mode, use:
+
+ Allow_Private_IPs = 0
+
+=back
+
+=head1 ENVIRONMENT
+
+checklink uses the libwww-perl library which has a number of environment
+variables affecting its behaviour. See L</"SEE ALSO"> for some
+pointers.
+
+=over 5
+
+=item B<W3C_CHECKLINK_CFG>
+
+If set, overrides the path to the configuration file.
+
+=back
+
+=head1 SEE ALSO
+
+The documentation for this program is available on the web at
+L<http://www.w3.org/2000/07/checklink>.
+
+LWP(3), L<Net::FTP(3)>, L<Net::NNTP(3)>, L<Net::IP>.
+
+=head1 AUTHOR
+
+This program was originally written by Hugo Haas <hugo@w3.org>, based on
+Renaud Bruyeron's F<checklink.pl>. It has been enhanced by Ville SkyttE<228>
+and many other volunteers since. Use the <www-validator@w3.org> mailing list
+for feedback, see L<http://validator.w3.org/docs/checklink.html#csb> for more
+information.
+
+This manual page was written by FrE<233>dE<233>ric SchE<252>tz
+<schutz@mathgen.ch>, for the Debian GNU/Linux system (but may be used by
+others).
+
+=head1 COPYRIGHT
+
+This program is licensed under the W3CE<174> Software License,
+L<http://www.w3.org/Consortium/Legal/copyright-software>.
+
+=cut
Received on Thursday, 5 August 2010 14:46:55 UTC