- From: Henry Story <henry.story@bblfish.net>
- Date: Wed, 23 Jun 2004 14:05:02 +0200
- To: ietf-http-wg@w3.org
SUMMARY: There is currently an asymmetry in reporting errors between the client and the server. The server can return an error status to the client on its request, but the client cannot tell the server that it has returned an invalid response. This proposal rectifies this problem with a RESTful easily implementable and backward compatible solution to this asymmetry, by proposing a new ERR (ERROR) HTTP method to complement GET, POST, PUT, ... BACKGROUND: The W3C is requiring strict adherence to many new standards. XML for example has to be well formed, and should be rejected if not. The well formedness of an XML response depends on the XML payload as well as the HTTP headers (such as mime types) that accompany the response. If these are broken, as can happen all to easily when a web server is improperly configured, the client has no simple and automatic way of notifying the resource that it is broken. For B2B applications this is not too much of an issue, as a lot of resources and many channels are available between the consumer of a resource and its producer. B2B has up till now been the main consumer of XML. In the consumer world the dynamics are very different, and will lead to a widening gap between specification and implementation. This is why this issue has appeared on the Atom mailing list[2]. But I believe the proposed solution to that problem can be generalised in such a way as to help the forces of standardisation across the whole web. PROPOSAL: Note this is a fledgling proposal, and will clearly need some growing up. When a client receives a malformed server response it CAN (SHOULD?) notify the resource that it is broken, by sending a ERR request, identical in all ways except for the ERR method to the original request, plus a couple of extra ERR specific headers: -Error-Message: a human readable standard error message -Error-Code: A set of to be defined error codes that categorise the type of error -Error-Spec: A pointer to RFC document sections that explain the error -Error-Date: the date the request was initially sent -Error-Method: the method (GET, POST, ...) of the original request. -Error-ContentLength: the length of the human readable error text that could be the body of this message ERR should probably be limited to certain specific types of errors, including things like broken XML, XML encoding incorrectly specified in the header, or other errors relating to well known RFC or specs. This is to be fleshed out... EXAMPLE: -------8<------- GET /index.xml HTTP/1.x Content-encoding: text/xml; charset=UTF-8 Accept: */* Accept-Encoding: gzip, deflate;q=1.0, identity;q=0.5, *;q=0 Accept-Language: en-us, ja;q=0.62, de-de;q=0.93, de; ... <?xml version="1.0" encoding="iso-8859-1" ?> <pløtz/> ------->8------- The response is broken though clearly interpretable. Clients (in the wider of Consumer2C or B2C) will therefore attempt to accommodate the standards due to market pressure. Market pressures are close to physical laws in their ferocity. We cannot change them. As a result more an more such breakages will occur, and the standards will be left in the dust of this vicious whirlwind.[1] In any case fighting against it is going to be very tiresome. Much easier is to require clients to at least send an ERR response to the resources if they are going to bypass the standards. If you allow us to imagine a future where resources are intelligent enough to fix themselves, we can see how this can help the web heal itself, automatically. Here is an example of the clients message: -------8<------- ERR /index.xml HTTP/1.x Content-encoding: text/xml; charset=UTF-8 Accept: */* Accept-Encoding: gzip, deflate;q=1.0, identity;q=0.5, *;q=0 Accept-Language: en-us, ja;q=0.62, de-de;q=0.93, de; Error-Message: XML is of incorrect content type Error-Code: XXXX Error-Spec: RFCXYZ,sec 3; RFCXXX, sec54 Error-Date: Saturday 19 June 2004, 18:05:30 GMT (whatever encoding) Error-Method: GET Error-ContentLength: 63 The Mime type of the content was text/xml. This requires the content to be in ASCII format, but we found some UTF-8 characters in the message. We could interpret the message at present but will not necessarily be able to do so in the future. Please refer to RFCXYZ, sec 3 and RFCXXX, sec54 for more information. These can be found at http://ietf.org/ ------->8------- ADVANTAGES: 1. RESTfulness Proxies and other intermediaries can join in to make the Web a more standard place. 2. Backward compatible This proposal could very well already work with the current web architecture, without any problem. I have tried it myself: -------8<------- hjs@bblfish:0$ telnet localhost 80 Trying 127.0.0.1... Connected to localhost. Escape character is '^]'. ERR /index.html HTTP/1.1 Host: bblfish.localhost Message: invalid XML HTTP/1.1 501 Method Not Implemented Date: Sat, 19 Jun 2004 10:10:37 GMT Server: Apache/1.3.29 (Darwin) Vary: accept-language,accept-charset Allow: GET, HEAD, OPTIONS, TRACE Connection: close Transfer-Encoding: chunked Content-Type: text/html; charset=iso-8859-1 14c <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN"> <HTML><HEAD> <TITLE>501 Method Not Implemented</TITLE> </HEAD><BODY> <H1>Method Not Implemented</H1> ERROR to /index.html not supported.<P> Invalid method in request ERROR /index.html HTTP/1.1<P> <HR> <ADDRESS>Apache/1.3.29 Server at bblfish.local Port 80</ADDRESS> </BODY></HTML> ------->8------- Clearly this is not the response we want in a web that has adopted this proposal, but it already has the correct side effect: namely it adds an error message in my apache error log: -------8<------- [Sat Jun 19 12:10:45 2004] [error] [client 127.0.0.1] Invalid method in request ERROR /index.html HTTP/1.1 ------->8------- Apart from that of course it correctly informs the client that the ERR message is not available, and so that sending further such requests is pointless. 3. This proposal avoids the vicious circle that other workarounds require: namely a file somewhere that specifies where to send error reports, this file itself perhaps being malformed. 4. The resource to which ERR is sent is known to be alive, since it just responded to the request. The ERR can furthermore be sent as part of the same tcp connection. REFERENCES This came out of a discussion on the atom mailing list. [1] originally proposed here: http://www.imc.org/atom-syntax/mail-archive/msg05112.html [2] a concise explanation for the need for the ERR method: http://www.imc.org/atom-syntax/mail-archive/msg05146.html [3] a long discussion on #rdfig where I try to respond to all the questions thrown at me by Danny Ayers http://www.ilrt.bris.ac.uk/discovery/chatlogs/rdfig/2004-06 -19.html#T14-48-59 [4] a Page on the Atom wiki that may be kept up to date on this issue: http://www.intertwingly.net/wiki/pie/PaceErrVerb
Received on Wednesday, 23 June 2004 08:05:10 UTC