Link on page 'http://www.libertex.com/site/links.html':
news story
SOAP broken link fire-and-forget (single recipient) message
http://www.wibble.com/site/links.html
http://news.bbc.co.uk/story883488.stm
404
404 - File Not Found
134
ReportedCheckedOK
datetime
1 Browser sends HTTP GET for URI to Webserver 1
2 Webserver 1 sends HTML page
3 User follows link
3.1 User clicks link pointing at Webserver 2
3.2 HTTP 404 response - target not found
3.3 Display 404 page to user as usual
4 Browser sends SOAP message (above) to Webserver 1
Possible link status codes:
Link unchecked
Checked, ok,
Checked, broken,
Reported broken, unchecked
Reported broken, Checked, ok,
Reported broken, Checked, broken,
NB 1. SOAP messages are only sent if link status is either Unchecked or CheckedOK, and if the link response is 404
2. Once message is received, it is expected that the webserver marks the link with ReportedBrokenUnchecked, to prevent other
browsers from reporting it (could result in DDOS-like requests!)
3. May want some sort of indication as to what protocol/XMLNS for broken links the server supports
4. May want to enable server to automate the initial checking
-
Components required
1) XSD definition for this SOAP message
2) Firefox plugin for browser to detect 404's and send SOAP messages (to sites that support it)
3) Servlet to receive broken links and display broken links' info
Optional
1) Tomcat Filter to retrofit existing website with support for broken link highlighting
2) Java script client version (with rewrite support in above filter)
3) Way to indicate what broken link support is available, and how to tell if links are broken (pattern match or class=...