- From: Mercurial notifier <nobody@w3.org>
- Date: Thu, 05 Aug 2010 14:46:52 +0000
- To: link-checker updates <www-validator-cvs@w3.org>
changeset: 2:ddeae812731c tag: import-from-validator user: ville date: Thu Mar 18 21:44:14 2004 +0000 files: ChangeLog ChangeLog.old MANIFEST META.yml Makefile.PL README bin/checklink.pod description: First CPANified version, copied here from the validator dir. diff -r 55210bc792ab -r ddeae812731c ChangeLog --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/ChangeLog Thu Mar 18 21:44:14 2004 +0000 @@ -0,0 +1,3 @@ +2004-02-18 Ville Skyttä <ville.skytta@iki.fi> + + * First CPAN version. diff -r 55210bc792ab -r ddeae812731c ChangeLog.old --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/ChangeLog.old Thu Mar 18 21:44:14 2004 +0000 @@ -0,0 +1,856 @@ +2004-02-01 13:07 ville + + * checklink.pl: 2004. + +2003-11-25 22:46 ville + + * checklink.pl: Markup improvements: "fussy"-clean, towards HTML + 4.01 Strict and ISO-HTML. + +2003-11-24 21:11 ville + + * checklink.pl: Fix HTML "escaping" of URIs. Some were escaped + twice, some not at all. + +2003-11-22 17:40 ville + + * checklink.pl: Check recursion scope also for redirected + documents. + http://lists.w3.org/Archives/Public/www-validator/2003Nov/0074.html + +2003-11-22 17:20 ville + + * checklink.pl: Improve "To do" message wrapping in command line + mode. + +2003-11-22 17:17 ville + + * checklink.pl: Fix URI->abs() usage in absolutizing URIs. + +2003-10-20 23:19 ville + + * checklink.pl: Make ip_allowed() not choke on URIs that don't + implement host(). + http://lists.w3.org/Archives/Public/www-validator/2003Oct/0056.html + +2003-09-15 21:31 ville + + * checklink.pl: Unbreak --masquerade, enhance documentation and + error messages. W3C Bugzilla #340, Debian bug #187791, thanks to + Dan Jacobson and Frederic Schutz for the report. + +2003-09-12 20:49 ville + + * checklink.pl: Make the configuration file optional. Thanks to + Hugo Haas for the heads up. Also, load Config::General only if + needed. + +2003-09-11 13:46 hugo + + * checklink.pl: Fixing the following: [Thu Sep 11 12:44:53 2003] + checklink.pl: Argument "help" isn't numeric in exit at + /home/hugo/bin/checklink.pl line 410. + +2003-07-27 19:19 ville + + * checklink.pl: Don't conditionalize the Term::ReadKey requirement, + but defer its loading until actually needed. system('stty') + doesn't work under -T without laundering $ENV{PATH}, and I don't + feel like going there :P + + Also, don't load CGI and CGI::Carp in command line mode; defer by + using require() instead of use(). + +2003-07-27 18:38 ville + + * checklink.pl: Implement option to disallow checking non-public IP + addresses; disallow by default in CGI mode and allow in command + line mode. + + New requirement: Net::IP. + +2003-07-26 22:20 ville + + * checklink.pl: Use Text::Wrap for beautifying command line output, + some other output tweaks. + +2003-07-26 22:16 ville + + * checklink.pl: Fix redirect loop detection, and add a descriptive + error message. + + Previously, only loops like L1 -> L2 -> L3 -> L1 were detected; + ones like L1 -> L2 -> L3 -> L2 resulted in infinite recursion. + + Fixes <http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=188372>, + thanks to Dan Jacobson and Frederic Schutz for reporting this. + +2003-07-26 18:48 ville + + * checklink.pl: Plug myself into the man page's AUTHOR section, per + Frederic's request :) + +2003-07-23 23:08 ville + + * checklink.pl: Include POD documentation, courtesy of Frederic + Schutz. + +2003-06-16 00:41 ville + + * checklink.pl: Document env vars affecting FTP configuration, and + make passive mode the default (#226). + +2003-06-15 17:56 ville + + * checklink.pl: Trust and launder the NNTPSERVER and NEWSHOST + environment variables in order to support news: URIs without + explicit hostnames under -T. + +2003-06-15 17:42 ville + + * checklink.pl: Call get_timestamp always without arguments. + +2003-06-02 20:27 ville + + * checklink.pl: -? == --help, be a bit more verbose with the usage + of --location and --recursive (Debian bug report #189642). + +2003-05-24 23:32 link + + * checklink.pl: Merging from validator-0_6_0-branch, at tag + validator-0_6_2-beta1, to HEAD. + +2003-04-19 22:50 ville + + * checklink.pl: - Abort check in case of non-understood command + line options (#203, Debian bug #187792). - Allow bundling of + command line options. - Bump 2002->2003 in copyright. + +2003-02-05 23:47 ville + + * checklink.pl: Skip 'base' tags with empty hrefs (#138). Put HTML + validity a bit more prominently into beginning of results page. + +2003-02-02 21:03 ville + + * checklink.pl: - Implemented external configuration file stuff + (#127). We now require Config::General >= 2.06. - Fixed basic + authentication (#122). If the trusted domain is not specificed, + we only send the authentication to the same host which requested + it. Also added a notice about this to the error page/command + line message if no suitable authentication info was given. - + Changed text/javascript to application/x-javascript, thanks to + Nicolas Roeser for the report. (see + <http://lists.w3.org/Archives/Public/www-validator/2003Jan/0158.html>) + - Fixed missing results from the summary when a file:// URI was + present (also reduces number of error log warnings emitted). - + Command line --help clarifications, reformatting. - Code + cleanups. + +2003-01-07 22:07 ville + + * checklink.pl: Fix markup in "Authorization Required" error + document. + +2002-12-09 02:16 ville + + * checklink.pl: - Fix multiple brainos in cookie code. + +2002-12-08 16:47 ville + + * checklink.pl: - Rewrote recursion scope checking, was failing + with URIs with no slash after the hostname; eg. + <http://validator.w3.org> caused checklink to wander to other + sites (#115). - Report CVS revision as version number. + +2002-12-07 21:55 ville + + * checklink.pl: - Send Accept: application/xhtml+xml, text/html, + */*;q=0.5 (#112) - Sanitize Accept-Language headers; either send + the ones from a browser or *. In command line mode, really + default to * and send the ones given with the command line + option. (#110) - Add messages for codes 303 (#111), 307 and 406 + (#55). - Add "for" attribute for all <label>'s and the + corresponding id's so that MSIE users can enjoy clicking the + labels. - Some internal cleanup. + +2002-11-23 23:37 ville + + * checklink.pl: - Options can be saved in a cookie (#56). - Added + a JavaScript check against submitting empty URIs. - Style + tweaks, ala Validator. - Point to new docs location (relative). + +2002-11-23 17:06 ville + + * checklink.pl: - Tweak "form page" layout. - Don't say "Valid + anchors!" if no anchors found. - Fix HTML output from command + line. + +2002-11-22 01:27 ville + + * checklink.pl: Output and style nits. + +2002-10-27 15:04 ville + + * checklink.pl: Removed the debug logic and reworked the guess + whether the script is being run as a CGI or from the command + line; the problems this caused outweigh its usefulness [1]. Oh, + and this fixes #26. + + [1] <http://nagoya.apache.org/bugzilla/show_bug.cgi?id=13914> + +2002-10-27 14:08 ville + + * checklink.pl: Unbreak the source download link (introduced in my + previous commit). Found by checking the output of checklink with + itself. Yay :) + +2002-10-27 12:55 ville + + * checklink.pl: - Load Getopt::Long only in command line mode. - + Tighter lexical scoping for variables. - Output version/revision + number consistently everywhere. + +2002-10-27 01:43 ville + + * checklink.pl: Use keep-alive if the installed LWP is >= 5.6. + +2002-10-27 00:49 ville + + * checklink.pl: - Made W3C::UserAgent more OO avoiding use of + W3C::CheckLink's lexicals. - Tuned HTTP header sending to make + mod_perl 1.99_05 happy. - Some style tweaks; reorganized + use()'s, added missing ones. + +2002-10-26 22:19 ville + + * checklink.pl: Swap $REVISION and $VERSION in order to be more + consistent with Perl stuff in general, make sure that we have at + least two digits in the version number. + +2002-10-26 22:04 ville + + * checklink.pl: Whitespace only (reindentation). + +2002-10-26 21:29 ville + + * checklink.pl: Portability update, now actually works on Win2K. + + Term::ReadKey is now used for reading the password from the + command line if it is available, falling back to stty if it + isn't. + +2002-10-23 23:31 ville + + * checklink.pl: Fix all function prototypes, and remove them from + methods. + +2002-10-23 07:32 hugo + + * checklink.pl: Security fix: cross-scripting vulnerability found + and fixed by ITO Tsuyoshi <tsuyoshi@is.s.u-tokyo.ac.jp>. + +2002-08-22 04:03 link + + * checklink.pl: Applying checklink-options.patch from Ville SKytta. + Thanks Ville! Use GetOpt::Long for command-line option handling. + Add &version/-V. + +2002-08-22 03:56 link + + * checklink.pl: Applying checklink-cleanup.patch from Ville SKytta. + Thanks Ville! (A bunch of cleanups; police whitespace, namespace, + HTML, etc.) + +2002-08-20 04:51 link + + * checklink.pl: Merging to HEAD from validator-0_6_0 branch. This + will be validator-0_6_0b1c1 (Beta 1, Candidate 1). + +2002-07-08 00:54 link + + * checklink.pl: Apply "Local Validator" patch from Ville Skytta. + Thanks Ville! + +2002-06-21 23:29 eric + + * checklink.pl: s/netloc/host/ to reflect API change. made it easy + to rerun HTTP sessions in the debugger. covering for that + slacker Hugo. + +2002-02-01 23:29 hugo + + * checklink.pl: Updated copyright info + +2002-01-11 14:36 hugo + + * checklink.pl: Added support for application/xhtml+xml documents + +2001-11-26 20:18 hugo + + * checklink.pl: Applied patch from Henk P. Penning + <henkp@cs.uu.nl>: Using Time::HiRes instead of sys/syscall.ph + (should be more portable) + + See: + http://lists.w3.org/Archives/Public/www-validator/2001OctDec/0224.html + +2001-09-21 17:15 hugo + + * checklink.pl: Tiny change: escaping less characters in URIs (0-9 + and . are no longer escaped) + +2001-09-17 23:23 hugo + + * checklink.pl: Fixed a URI escaping problem + +2001-08-01 14:53 hugo + + * checklink.pl: Added support for *_proxy environment variables + +2001-07-25 14:34 hugo + + * checklink.pl: Specifying the charset (Latin-1) + +2001-06-27 18:59 hugo + + * checklink.pl: Apllied patch from Terje Bless <link@tss.no>: + http://lists.w3.org/Archives/Public/www-validator/2001AprJun/0496.html + +2001-06-05 13:09 gerald + + * checklink.pl: typo + +2001-04-04 22:18 hugo + + * checklink.pl: Ooops, left a closing tag... + +2001-04-04 22:16 hugo + + * checklink.pl: Fixed an HTML validity problem in the summary only + mode: + + http://lists.w3.org/Archives/Public/www-validator/2001AprJun/0001.html + +2001-03-27 15:58 hugo + + * checklink.pl: Added directions for inaccessible javascript: URIs + +2001-03-27 02:12 hugo + + * checklink.pl: Got rid of $_contact_address Changed the mailto: + URIs and instructions to include 'checklink' in the subject when + sending a mail to www-validator + +2001-03-27 01:46 hugo + + * checklink.pl: Made the "Summary only" function do the same as + John <rowe@excc.ex.ac.uk>'s quiet function from: + http://lists.w3.org/Archives/Public/www-validator/2001JanMar/0261.html + This is a tiny change. + +2001-03-27 01:33 hugo + + * checklink.pl: Added a --depth option based on the patch provided + by John <rowe@excc.ex.ac.uk>: + http://lists.w3.org/Archives/Public/www-validator/2001JanMar/0261.html + + Had to tweak it to make the combination of -r and -D work. + +2001-03-27 00:39 hugo + + * checklink.pl: Maybe the base element problem is going to go away + completely now. I have added a change by John + <rowe@excc.ex.ac.uk> which basically does not trust + HTTP::Response's base function, and it seems to be the way to go. + +2001-01-22 18:00 hugo + + * checklink.pl: Fixed typo + +2001-01-22 17:58 hugo + + * checklink.pl: Made it clearer that www-validator is a mailing + list to avoid confusion + +2001-01-22 17:49 hugo + + * checklink.pl: Now suggests to send bug reports and comments to + www-validator@w3.org Stopped suggesting people to send me email + when the max number of documents checked is reached in + recursive mode. + +2001-01-19 15:59 hugo + + * checklink.pl: Changed the text before the input box too + +2001-01-19 15:52 hugo + + * checklink.pl: Changed title to "W3C Link Checker" only for the + form instead of "W3C® Link Checker: Revision: 2.68 (c) 1999-2001 + W3C" which was too verbose + +2001-01-18 23:58 hugo + + * checklink.pl: Committing changes that I made a long time ago... + + - Added 410 HTTP error code in the list of errors. - Simplified + a few regular expressions (got rid of some /'s). - Added support + for //... URI's. - Commented a bit more the code because I was + getting lost. - Simplified already_processed(). - Added a + --hide-same-realm option: hide 401's that are in the same + realm as the document checked. - Added a --masquerade option to + check documents on the local system as if they were on the + Web (based on a patch by David Carlisle <davidc@nag.co.uk>). + Example: checklink -q -b --masquerade + /afs/w3.org/pub/WWW/Math/Group/draft/cr/ + http://www.w3.org/TR/2000/CR-MathML2-20001113/ appendixe.html - + 500 errors are now split in two cases: DNS problems (LWP reports + that as a 500...) and real internal server errors. - Fixed a + footer problem in the online problem. - Fixed a duplicate id + problem in the error code references. + +2000-09-25 20:38 hugo + + * checklink.pl: Undid change done in 2.62 about base tag handling. + I'm not really sure why I did this change in the first place, I + might have to fix a bug later... + +2000-08-17 18:10 hugo + + * checklink.pl: Replaced require by use and specified that + HTML::Parser version 3 at least is required. + +2000-08-15 21:03 hugo + + * checklink.pl: Reverting line list code because it needs tweaking + +2000-08-12 00:35 hugo + + * checklink.pl: Got rid of unnecessary verbosity in the line list. + Added a valid links message in the text version. + +2000-08-05 17:46 hugo + + * checklink.pl: Now checks <blockquote cite="..."> + +2000-08-05 02:20 hugo + + * checklink.pl: Fixed a confusion in the use of URI::abs() + +2000-08-05 01:43 hugo + + * checklink.pl: Do not consider base's href as a link anymore (was + wrong and was causing a loop in recursive mode) + +2000-08-03 21:38 hugo + + * checklink.pl: Discovered the existence of + HTTP::Headers::authorization_basic + +2000-07-17 17:26 hugo + + * checklink.pl: Fixed typo + +2000-07-14 00:18 hugo + + * checklink.pl: Added links to the documentation + +2000-07-13 19:10 hugo + + * checklink.pl: Added a test in the case when the encoding would be + identity + +2000-07-12 22:14 hugo + + * checklink.pl: Now checks the Content-Encoding too + +2000-07-10 18:21 hugo + + * checklink.pl: Fixed a bug due to the migration to the version 3 + API + +2000-06-23 22:09 hugo + + * checklink.pl: Clarified the footer + +2000-06-23 19:56 hugo + + * checklink.pl: Increased the limit for the online version to 150 + documents. + +2000-06-12 18:37 hugo + + * checklink.pl: Fixed a bad wording + +2000-05-16 16:12 hugo + + * checklink.pl: Fixed a display problem + +2000-05-16 15:48 hugo + + * checklink.pl: Clean up. Fixed a tiny bug. + +2000-05-11 18:44 hugo + + * checklink.pl: Cosmetic changes + +2000-05-10 21:32 hugo + + * checklink.pl: Speeded up parsing. Fixed display problem in the + recursive version. Made things simpler in check_uri(). + +2000-05-09 22:44 hugo + + * checklink.pl: Next time, I'll think twice before committing: - + Removed superfluous test - Using the same strategy for text + version + +2000-05-09 22:36 hugo + + * checklink.pl: Report fragments as really broken is the HTTP end + code is 200 + +2000-05-07 18:16 hugo + + * checklink.pl: Removed unnecessary test. Added a header for the + link results. + +2000-05-05 18:13 hugo + + * checklink.pl: Improved presentation for 500's + +2000-05-05 02:33 hugo + + * checklink.pl: Improved presentation for redirected broken URI's. + Added note about directory redirects. + +2000-05-05 01:58 hugo + + * checklink.pl: Added anchor to the results for better + accessibility. Removed a lot of colors: it's more readable. + Added a heading for anchors report. + +2000-05-05 01:29 hugo + + * checklink.pl: Now uses the version 3 of the API of HTML::Parser. + Got rid of the chunk stuff, since the new parser doesn't care + about the size of the data parsed (good). Added attributes + checked. Still not perfect, but better. Fixed a display bug in + the text version. Now parses <?xml:stylesheet ...?>. I'm not + happy about the way it's done though. + +2000-05-04 23:28 hugo + + * checklink.pl: Changed language from 'en' to '*'. Fixed a bug in + the online version when the document cannot be processed. + Changed the presentation to make clearer what needs to be done. + Changed the directions to fix the links. + +2000-04-26 20:07 hugo + + * checklink.pl: Only use XML extensions when required in the parser + +2000-04-11 01:10 hugo + + * checklink.pl: Fixed a bug when accessing a file which does not + exist. Now counts the lines in <? ... ?> parts. Now reports + when an anchor is defined twice on the same line. + +2000-04-06 00:28 hugo + + * checklink.pl: Changed the format of the HTML output to make it + accessible. Broken links sorted by priority. Fixed a display + issue for 200 replies. Removed useless statistics and cleaned up + a few things. Changed the email address to report configuration + problems. Added links to HTML Validator and CSS Validator. + Added optional Accept-Language headers. + +2000-03-30 01:31 hugo + + * checklink.pl: Better handling of doctypes. + +2000-03-20 22:55 hugo + + * checklink.pl: I knew I would break something changing the data + structures: I love it when a plan comes together! Fixed a table + display problem. + +2000-03-02 01:15 hugo + + * checklink.pl: Fixed authentication problem. Added a limit for + the number of documents checked with the online version. Fixed + the -h command line option + +2000-02-25 00:22 hugo + + * checklink.pl: Major changes: - can check documents recursively - + fixed a bug in the parsing of the URI in the CGI version - now + checks URI's for DTD's - now checks id attributes - doesn't check + <a name="..."> for XHTML Basic 1.0 + +2000-02-18 00:56 hugo + + * checklink.pl: Changed the way results are stored internally Added + an option to hide directory redirects: http://www.w3.org/TR -> + http://www.w3.org/TR/ Fixed some stats errors Fixed some display + problems Separated directory redirects in the report + +2000-02-11 01:03 hugo + + * checklink.pl: Fixed nasty bug: when providing authentication + information, 401's were not reported. + +2000-02-09 17:41 hugo + + * checklink.pl: Fixed markup problem (thanks to Patrick D. F. Ion + <ion@ams.org>) + +2000-02-09 01:11 hugo + + * checklink.pl: Changed the default for HTML version to "Show + redirects" + +2000-02-08 23:48 hugo + + * checklink.pl: "use strict;" which had somehow disappeared is + back. It helped me fixing a typo. + +2000-02-08 23:19 hugo + + * checklink.pl: Added comments Now correctly reports broken + fragments with redirects + +2000-02-03 20:30 hugo + + * checklink.pl: Now handling empty anchors. Worked on the command + line version. Added link to source code. + +2000-01-28 23:19 hugo + + * checklink.pl: Added links to online versions + +2000-01-27 01:04 hugo + + * checklink.pl: Added W3C logo + +2000-01-27 00:29 hugo + + * checklink.pl: W3C is registered + +2000-01-27 00:24 hugo + + * checklink.pl: Encoding HTML correctly + +2000-01-26 23:45 hugo + + * checklink.pl: Fixed HTTP answer + +2000-01-21 01:00 hugo + + * checklink.pl: Some more explanation + +2000-01-21 00:48 hugo + + * checklink.pl: Fixed problem in the summary table + +2000-01-21 00:01 hugo + + * checklink.pl: Backward compatibility with Renaud's script + +2000-01-20 23:59 hugo + + * checklink.pl: Fixed escaped URI's problem + +2000-01-20 23:40 hugo + + * checklink.pl: Fixed line count + +2000-01-18 00:16 hugo + + * checklink.pl: Fixed copyright + +2000-01-18 00:16 hugo + + * checklink.pl: Added summary and advices. + +2000-01-13 19:32 hugo + + * checklink.pl: Now validates + +2000-01-13 19:27 hugo + + * checklink.pl: Fixed form + +1999-12-15 20:03 hugo + + * checklink.pl: Added authetication in the command line version. + Fixed a few bugs. Added proxy support + +1999-12-14 17:20 hugo + + * checklink.pl: Fixed problem with page tested in report. Added + cache control headers. + +1999-12-05 02:31 hugo + + * checklink.pl: Line count is still flaky but the CGI version does + what is expected. + +1999-12-05 01:34 hugo + + * checklink.pl: Fixed redirect + 401 problem + +1999-12-05 01:19 hugo + + * checklink.pl: More detailed output + +1999-12-05 01:00 hugo + + * checklink.pl: Fixed problems with 401's + a few other bugs + +1999-12-02 03:42 hugo + + * checklink.pl: Supports authentication with the CGI interface & + output is more accurate + +1999-12-01 23:52 hugo + + * checklink.pl: CGI interface improved + +1999-12-01 22:40 hugo + + * checklink.pl: Fixed various CGI interface problems + +1999-12-01 21:53 hugo + + * checklink.pl: CVS messed up a regular expression... fixed + +1999-12-01 21:47 hugo + + * checklink.pl: Fixed display bugs. Line count still flaky. + +1999-11-26 15:14 hugo + + * checklink.pl: Fixed line count. Parsing speed up. + +1999-11-25 00:04 hugo + + * checklink.pl: new version of checklink + +1999-05-05 09:43 renaudb + + * checklink.pl: split request thing before registering in LWP UA + +1999-04-28 22:31 renaudb + + * checklink.pl: added plenty of color codes for errors + +1999-03-09 00:34 renaudb + + * checklink.pl: added links to CVS server + +1999-03-09 00:19 renaudb + + * checklink.pl: now HTTP return code 500 is considered as a broken + link, and is highlighted using the color red + +1999-03-02 00:35 renaudb + + * checklink.pl: modified front end + +1999-03-01 22:09 renaudb + + * checklink.pl: added comment about prerequesite, and testing write + access to CVS base + +1999-02-21 02:25 renaudb + + * checklink.pl: put more comments in the code + +1999-02-21 02:13 renaudb + + * checklink.pl: made checklink handle 401s the right way...yoohoo + +1999-02-19 04:22 renaudb + + * checklink.pl: added color and legend for 403s + +1998-11-13 03:10 renaudb + + * checklink.pl: added more verbose output + +1998-09-28 22:24 renaudb + + * checklink.pl: fixed colors on redirect and typo in title + +1998-09-24 03:24 renaudb + + * checklink.pl: added legend + +1998-09-12 02:05 renaudb + + * checklink.pl: fixed weird CGI error + +1998-09-12 00:34 renaudb + + * checklink.pl: finished handling of 401s. fixed double responses + too + +1998-09-10 03:33 renaudb + + * checklink.pl: 302 handled + +1998-09-09 23:38 renaudb + + * checklink.pl: added flag for text in links + +1998-09-09 16:43 renaudb + + * checklink.pl: added few comments + +1998-09-06 04:00 renaudb + + * checklink.pl: add use strict and -T. almost working + +1998-09-05 03:19 renaudb + + * checklink.pl: added smarts for redirects + +1998-09-04 19:12 renaudb + + * checklink.pl: changed auth method fom GET to POST. yeeek. + +1998-09-04 04:21 renaudb + + * checklink.pl: nope + +1998-09-03 21:34 renaudb + + * checklink.pl: added version, link to original page, changing + title + +1998-09-03 03:20 renaudb + + * checklink.pl: fixed auth + +1998-09-02 03:05 renaudb + + * checklink.pl: fixed couple bugs. auth no worky + +1998-08-31 23:54 renaudb + + * checklink.pl: added line numbers. still buggy + +1998-08-31 19:16 renaudb + + * checklink.pl: added no-buffering + +1998-08-31 18:55 renaudb + + * checklink.pl: added version tag + +1998-08-31 18:53 renaudb + + * checklink.pl: first commit of team checklink + diff -r 55210bc792ab -r ddeae812731c MANIFEST --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/MANIFEST Thu Mar 18 21:44:14 2004 +0000 @@ -0,0 +1,8 @@ +ChangeLog Recent changes +ChangeLog.old Older changes +Makefile.PL +MANIFEST +README Start by reading this +bin/checklink The link checker +bin/checklink.pod Manual page for the link checker +META.yml diff -r 55210bc792ab -r ddeae812731c META.yml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/META.yml Thu Mar 18 21:44:14 2004 +0000 @@ -0,0 +1,18 @@ +# http://module-build.sourceforge.net/META-spec.html +#XXXXXXX This is a prototype!!! It will change in the future!!! XXXXX# +name: W3C-LinkChecker +version: 3.9.1 +version_from: bin/checklink +installdirs: site +requires: + CGI: 0 + Config::General: 2.06 + HTML::Parser: 3 + LWP::UserAgent: 0 + Net::IP: 0 + Term::ReadKey: 2 + Time::HiRes: 0 + URI: 0 + +distribution_type: module +generated_by: ExtUtils::MakeMaker version 6.17 diff -r 55210bc792ab -r ddeae812731c Makefile.PL --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/Makefile.PL Thu Mar 18 21:44:14 2004 +0000 @@ -0,0 +1,23 @@ +use 5.00503; +use ExtUtils::MakeMaker; + +WriteMakefile( + NAME => 'W3C::LinkChecker', + ABSTRACT => 'W3C Link Checker', + AUTHOR => 'The W3C Link Checker Team <www-validator@w3.org>', + VERSION_FROM => 'bin/checklink', + PREREQ_PM => { + CGI => 0, + Config::General => 2.06, + HTML::Parser => 3.00, + LWP::UserAgent => 0, + Net::IP => 0, + Term::ReadKey => 2.00, + Time::HiRes => 0, + URI => 0, + }, + EXE_FILES => [ 'bin/checklink' ], + MAN1PODS => { 'bin/checklink.pod' => + '$(INST_MAN1DIR)/checklink.$(MAN1EXT)', + }, +); diff -r 55210bc792ab -r ddeae812731c README --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/README Thu Mar 18 21:44:14 2004 +0000 @@ -0,0 +1,44 @@ +W3C-LinkChecker +=============== + +This distribution contains the W3C Link Checker. + +The link checker can be run as a CGI script in a web server as well as +on the command line. The CGI version provides a HTML interface as +seen at <http://validator.w3.org/checklink>. + +To install the distribution for command line use: + + perl Makefile.PL + make + make test + make install # as root + +To install the CGI version, in addition to the above, copy the +bin/checklink script into a location in your web server from where +execution of CGI scripts is allowed, and make sure that the web server +user has execute permissions to the script. The CGI directory is +typically named "cgi-bin" somewhere under your web server root +directory. + +For more information, please consult the POD documentation in the +checklink.pod file, typically (in the directory where you unpacked the +source): + + perldoc ./bin/checklink.pod + +COPYRIGHT AND LICENCE + +Written by the following people for the W3C: +- Hugo Haas <hugo@w3.org> +- Ville Skyttä <ville.skytta@iki.fi> +- The W3C Link Checker Team <www-validator@w3.org> + +Copyright (C) 1994-2004 World Wide Web Consortium, (Massachusetts +Institute of Technology, European Research Consortium for Informatics +and Mathematics, Keio University). All Rights Reserved. This work is +distributed under the W3C(R) Software License [1] in the hope that it +will be useful, but WITHOUT ANY WARRANTY; without even the implied +warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. + +[1] http://www.w3.org/Consortium/Legal/copyright-software diff -r 55210bc792ab -r ddeae812731c bin/checklink.pod --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/bin/checklink.pod Thu Mar 18 21:44:14 2004 +0000 @@ -0,0 +1,192 @@ +$Id: checklink.pod,v 1.1 2004-03-18 21:44:14 ville Exp $ + +=head1 NAME + +checklink - check the validity of links in an HTML or XHTML document + +=head1 SYNOPSIS + +B<checklink> [ I<options> ] I<URI> ... + +=head1 DESCRIPTION + +This manual page documents briefly the B<checklink> command. + +B<checklink> is a program that reads an HTML or XHTML document, +extracts a list of anchors and lists and checks that no anchor is +defined twice and that all the links are dereferenceable, including +the fragments. It warns about HTTP redirects, including directory +redirects, and can check recursively a part of a web site. + +The program can be used either as a command-line version or as a CGI script. + +=head1 OPTIONS + +This program follow the usual GNU command line syntax, with long options +starting with two dashes (`-'). A summary of options is included below. + +=over 5 + +=item B<-?, --help> + +Show summary of options. + +=item B<-V, --version> + +Output version information. + +=item B<-s, --summary> + +Result summary only. + +=item B<-b, --broken> + +Show only the broken links, not the redirects. + +=item B<-e, --directory> + +Hide directory redirects - e.g. L<http://www.w3.org/TR> -> +L<http://www.w3.org/TR/>. + +=item B<-r, --recursive> + +Check the documents linked from the first one. + +=item B<-D, --depth n> + +Check the documents linked from the first one to depth n +(implies --recursive). + +=item B<-l, --location uri> + +Scope of the documents checked in recursive mode. By default, for +L<http://www.w3.org/TR/html4/Overview.html> for example, it would be +L<http://www.w3.org/TR/html4/>. + +=item B<-n, --noacclanguage> + +Do not send an Accept-Language header. + +=item B<-L, --languages> + +Languages accepted (default: '*'). + +=item B<-q, --quiet> + +No output if no errors are found. + +=item B<-v, --verbose> + +Verbose mode. + +=item B<-i, --indicator> + +Show progress while parsing. + +=item B<-u, --user username> + +Specify a username for authentication. + +=item B<-p, --password password> + +Specify a password. + +=item B<--hide-same-realm> + +Hide 401's that are in the same realm as the document checked. + +=item B<-t, --timeout value> + +Timeout for the HTTP requests. + +=item B<-d, --domain domain> + +Regular expression describing the domain to which the authentication +information will be sent. The default value can be specified in the +checklink configuration file. + +=item B<--masquerade "local remote"> + +Masquerade local dir as a remote URI. For example, the following results in +/my/local/dir/ being "mapped" to http://some/remote/uri/ + + --masquerade "/my/local/dir http://some/remote/uri/" + +As of revision 3.6.2.19 of checklink, --masquerade takes a single +argument consisting of two URIs, separated by whitespace. One usual way of +providing a value with embedded whitespace is to enclose it in quotes. + +=item B<-y, --proxy proxy> + +Specify an HTTP proxy server. + +=item B<-h, --html> + +HTML output. + +=back + +=head1 FILES + +=over 5 + +=item F</etc/w3c/checklink.conf> + +The main configuration file. You can use the L<W3C_CHECKLINK_CFG> environment +variable to override the default location. + +C<Trusted> specifies a regular expression for +matching trusted domains (ie. domains where HTTP basic authentication, if +any, will be sent). For example, the following configures only the w3.org +domain as trusted: + + Trusted = \.w3\.org$ + +C<Allow_Private_IPs> is a boolean flag indicating whether checking links +on non-public IP addresses is allowed. The default is true in command line +mode and false when run as a CGI script. For example, to disallow checking +non-public IP addresses, regardless of the mode, use: + + Allow_Private_IPs = 0 + +=back + +=head1 ENVIRONMENT + +checklink uses the libwww-perl library which has a number of environment +variables affecting its behaviour. See L</"SEE ALSO"> for some +pointers. + +=over 5 + +=item B<W3C_CHECKLINK_CFG> + +If set, overrides the path to the configuration file. + +=back + +=head1 SEE ALSO + +The documentation for this program is available on the web at +L<http://www.w3.org/2000/07/checklink>. + +LWP(3), L<Net::FTP(3)>, L<Net::NNTP(3)>, L<Net::IP>. + +=head1 AUTHOR + +This program was originally written by Hugo Haas <hugo@w3.org>, based on +Renaud Bruyeron's F<checklink.pl>. It has been enhanced by Ville SkyttE<228> +and many other volunteers since. Use the <www-validator@w3.org> mailing list +for feedback, see L<http://validator.w3.org/docs/checklink.html#csb> for more +information. + +This manual page was written by FrE<233>dE<233>ric SchE<252>tz +<schutz@mathgen.ch>, for the Debian GNU/Linux system (but may be used by +others). + +=head1 COPYRIGHT + +This program is licensed under the W3CE<174> Software License, +L<http://www.w3.org/Consortium/Legal/copyright-software>. + +=cut
Received on Thursday, 5 August 2010 14:46:55 UTC