- From: <noah_mendelsohn@us.ibm.com>
- Date: Mon, 17 Oct 2005 10:50:19 -0400
- To: "David Orchard" <dorchard@bea.com>
- Cc: www-tag@w3.org
- Message-ID: <OF1C06362F.7E0411D7-ON8525709D.004EF778-8525709D.005182E1@lotus.com>
Dave, Thanks for all the hard work on this. I've only done an initial readthrough, and want to give some more thought before commenting on how this stacks up overall as a TAG finding. At the very least, I think the examples in the middle do an excellent job of explaining the motivations of and tradeoffs faced by web site designers and web services users. As a start, I have a few detailed comments: * You introduce "state" by saying: "What is State: State is the data that pertains to an entity at a particular point in time." I wonder if it would be helpful to go on to say something like: "Most interesting resources have state of one sort or another, which is what allows them to provide interesting information when interacting with user agents on the Web. This finding concerns itself especially with two particular kinds of state that can be problematic: (1) per user or per session state, which can cause a resource to interact differently according to the user making the access or the network connection on which the request is received; and (2) state representing dependent or sub-resources that are not (and arguably should have been) themselves separately identified by a URI...for example, a bank account that is identified by a bank account number stored in a cookie, rather than in a separate URI." As it stands, the finding seems to suggest that all state is problematic, and I don't think that's true; I believe it's these two particular kinds of state that are at issue. * I think it would help to give more motivation and discussion for the "EPRs on the Web" section. Is this something we're recommending? Something that seems tempting but that nobody has a robust way to do? I think a paragraph or two on mapping the QNames of RefParms into hierarchical URIs and/or query parameters (or whatever it is you're recommending) would be helpful. * Possible bug: at one point in the Fabram example you say: "Alternatively, the EPR could insert the CustomerKey in the EPR:" I suspect you meant either "Alternatively, the CustomerKey could be encoded in the [address] property of the EPR" or maybe "Alternatively, service could obtain the CustomerKey from URI", or some such. I don't think an EPR can perform the operation of insertion, and in any case the CustomerKey has been in the EPR all along. * I think you should say: "Some of the key considerations are scalability, reliability, network and application performance, security, and ease of design, >>and promoting network effects on the World Wide Web, I.e. leveraging and contributing to a single, global information space.<<" So, take the above as neutral on whether we should do a finding, what its key points should be, and how close this draft comes as a start. I hope the above suggestions are helpful in any case. Thanks. -------------------------------------- Noah Mendelsohn IBM Corporation One Rogers Street Cambridge, MA 02142 1-617-693-4036 -------------------------------------- "David Orchard" <dorchard@bea.com> Sent by: www-tag-request@w3.org 10/15/2005 02:54 PM To: <www-tag@w3.org> cc: (bcc: Noah Mendelsohn/Cambridge/IBM) Subject: Rough text for State finding I've written up rough text for the state finding primarily for the EPR-47 discussion. If the direction is roughly correct, based on TAG and other feedback, I'll do the conversion to xmlspec, fill in the missing pieces, add refs, etc. for more formal publication. State and Applications This is a draft TAG finding on State. The purpose of the finding is to provide guidance to application developers on the use of Stateful or Stateless applications in a Web context. It examines a variety of designs for a canonical example application to illustrate the complex trade-offs in the designs. It uses HTML browser based and Web service based examples to show the similarities between the design decisions. The finding concludes with an analysis of the architectural property trade-offs between stateful and stateless applications. What is State State is the data that pertains to an entity at a particular point in time. A variety of software entities have state, ranging from applications to operating systems to network layers. The state of an entity changes over time triggered by some kind of event. The event could be a network message, a timer expiring or an application message. Entities that do not have state, that is there is no trigger that causes a transition, are called stateless. Abstract example Dirk decides to build an online banking application. Customers will be able to view their account balances and make transfers. The first step is logging on to the application. When the customer selects accounts view, the banking application will ask them for their username and password. If they have already entered their username and password, they will not be asked for it again. The system will automatically log the customer out if they haven?t done any activity for 10 minutes. We see a prototypical stateful application from the client perspective. The application has 2 states: logged-in and not-logged-in. This state may be realized by storing state on the client or on the server. Decisions The first decision is whether data or state is persisted. It?s either the data used to recreate the state, such as username and password, or it is the state itself. If the data is stored, then it must be stored in the client and then sent to the service for each request. Given a decision of storing the state rather than data used to create the state, another decision is whether the state is stored on the client or the server. Applications where the client stores the state are typically called stateless applications, even though there is state on the client. Given a decision of storing the state on the server, how is the state identified and transmitted by the client. Web applications will typically use URIs for identifying entities aka resources. Where does either the state identifier exist in the message to the server: in the URI, the message body, a particular HTTP header? Example using HTTP Authentication Dirk decides that the banking application will be stateless on the server and the client will resend the data for each request. The application has a URI for the entry page to the banking application and a link to the account balances. When any banking URI is requested, the username/password features of HTTP are used, usually implemented as a pop-up window asking for username and password. There are very few web sites that are built this way, perhaps the largest is the W3C web site. Most web sites use alternative technologies for logging on and they store the state using HTTP cookies or using URL rewriting. The primary reasons for customized security are security concerns, that is wanting greater control over the security timing out, and ease of use concerns, particularly wanting direct control over the look and feel of the screens including helpful tips and links to forgotten passwords. Example using URL Rewriting with client-side state Dirk decides that a customized security screen is needed. A new page with the entry of username and password is inserted in the application, after the ?show item? page in the state flow. Upon successful completion, the URL is rewritten to contain the state that the user has logged on. At first, Dirk was going to have the URL contain the username and password, but that was rejected for obvious security reasons. After the security page, any URLs in pages returned are rewritten to contain the state and the state is encrypted to prevent tampering and guessing. This approach has a significant downside of the URL rewriting. In general, it is unlikely that URLs with a particular users login state need to be exchanged or bookmarked. From a modeling perspective, the resources that would likely be identified are accounts and particular transactions, not login state. Also, it is difficult for the application to have full control over the URL and do the rewriting, it is difficult for the application to parse the URL to extra the state. HTTP Cookies offer the benefit of a well-defined place, the HTTP Cookie header, for storing and retrieving data without rewriting the URL. Example using Cookies with client-side state Nadia decides to change the banking application to store the application state in a cookie. The application still has URIs for the banking application page. The application stores the state in a cookie that is sent to the browser upon successful completion of the page, and sent back to the service on every request. Yet still, very few web applications are built this way. Most secured web sites use cookies where the state is stored on the server, rather than encapsulated in the client. The motivations are primarily about performance, particularly giving the serviced application the control over whether to keep the state in memory or passivate to disk. The state could get quite large or it may be difficult to serialize and so serialization to the client could be difficult. Another motivation is concerns about network performance if the state gets quite large and security of the state. A final motivation is the visibility of the state id Example using Cookies with session ids. Nadia further updates the banking application to store the log-in state in a server side component. The server-side component is identified with an id, commonly called a session id. This session id is stored in the cookie. Stateful resource identifers The previous examples explored the issues and designs related to session identification and transmission. As described in the URL rewriting example, the session information is probably not a stateful resource that requires an identifier. However, a particular user?s account view, particular bank account or particular transaction is intuitively a stateful resource where the identifier could include the particular account or transaction identifier. In the banking application, there are 2 different account balance URI designs: one URI for all users or URI per user. The first design does not have distinct URIs for each of the user account balances. Rather, there is a ?dispatch? URI and the particular user account requested is encoded in the request message or headers. For example, after logging in, the http cookie contains the user id. When the user requests the generic page, the particular user id is sent in the HTTP POST data. The second design has a distinct URI for each of the user ids. The user clicks on the login, and this redirects them to a unique URI for their account.. The URI per account design, sometimes called ?deep-linking?, has all the network effect advantages that the web has to offer: the users account is bookmarkable, exchangeable, etc. It does suffer from potential increased complexity as it may be easier to populate and parse the FORM POST data for the account id rather than the URI. Another problem with clicking on a URL that takes them to say 'cleared checks for my savings account' then if the website is redesigned (a frequent event, at least on the back end) then that URI will break. Either that or the website has to maintain complex mapping tables to handle versioning URIs across multiple versions of the website. Hence many websites would rather just force users to come in through a well defined home page and then focus on making navigation as easy as possible to get them quickly to where they want to be. It is worth noting that the application has 2 different types of state information that are being identified: the account balance and the session id. By putting the account id in the URI and keeping the session id separate, the application has achieved a separation and the different benefits achievable from the transient session information and the network effect of re-usable URIs. Web service example Dirk is tasked with making the banking application available as a Web service rather than HTML pages. He uses XML, SOAP, WSDL, and WS-Addressing to do this. The banking application is a service with an interface containing two operations:log-in and getBalance. The first operation is a log-in operation. If successful, it returns a WS-Addressing ?ReplyTo? containing an EPR that client should use for requesting the account information. The EPR contains a reference parameter that contains the session id and a reference parameter that contains the account id. An example, slightly modified from the WS-Addressing specification is <wsa:ReplyTo> <wsa:EndpointReference xmlns:wsa="http://www.w3.org/2005/08/addressing" xmlns:fabrikam=http://example.com/fabrikam> <wsa:Address>http://example.com/fabrikam/acct</wsa:Address> <wsa:ReferenceParameters> <fabrikam:CustomerKey>123456789</fabrikam:CustomerKey> <fabrikam:SessionID>ABCDEFG</fabrikam:SessionID> </wsa:ReferenceParameters> </wsa:EndpointReference> </wsa:ReplyTo> A request to the service, such as ?GetBalance?, might have a fragment like: <S:Envelope xmlns:S="http://www.w3.org/2003/05/soap-envelope" xmlns:wsa="http://www.w3.org/2005/08/addressing" xmlns:fabrikam="http://example.com/fabrikam"> <S:Header> ... <wsa:To>http://example.com/fabrikam/acct</wsa:To> <wsa:Action>http://example.com/fabrikam/GetBalance</wsa:Action> <fabrikam:CustomerKey wsa:IsReferenceParameter='true'>123456789</fabrikam:CustomerKey> <fabrikam:ShoppingCart wsa:IsReferenceParameter='true'>ABCDEFG</fabrikam:ShoppingCart> ... </S:Header> <S:Body> ... </S:Body> </S:Envelope> Alternatively, the EPR could insert the CustomerKey in the EPR: <wsa:ReplyTo> <wsa:EndpointReference xmlns:wsa="http://www.w3.org/2005/08/addressing" xmlns:fabrikam=http://example.com/fabrikam> <wsa:Address>http://example.com/fabrikam/acct/123456789</wsa:Address> <wsa:ReferenceParameters> <fabrikam:SessionID>ABCDEFG</fabrikam:SessionID> </wsa:ReferenceParameters> </wsa:EndpointReference> </wsa:ReplyTo> EPRs ?on the Web? For the purposes of EndpointReferences-47 discussions, there is no binding of an WSA Message Addressing Properties, including EPRs, into an HTTP request. Some hypothetical instances of the above EPR into an HTTP GET request: GET /fabrikam/acct?CustomerKey=123456789&SessionID=ABCDEFG GET /fabrikam/acct/CustomerKey/123456789?SessionID=ABCDEFG GET /fabrikam/acct/123456789?SessionID=ABCDEFG GET /fabrikam/acct/123456789 Cookie: $Version=?1?; SessionID=?ABCDEFG?; $Path=?/fabrikam? GET /fabrikam/acct/ Cookie: $Version=?1?; SessionID=?ABCDEFG?; CustomerKey=?123456789?; $Path=?/fabrikam? <<Insert WSDL samples? This would show the application structure and messages, but might also overly complicate what is already a moderately lengthy write-up. >> State decision factors The decision on where to place the state in the distributed application and how to identify the state are affected by numerous factors. Some of the key considerations are scalability, reliability, network and application performance, security, and ease of design. Roy Fielding argues in his REST dissertation [1] that stateless server has the benefits of increasing reliability, scalability, visibility and while potentially decreasing network performance. However, I believe the trade-offs from an application developers perspective are somewhat different, and need to be examined from a holistic perspective. Ease of Application construction There are two primary types of designers that are relevant: the network administrator that controls the deployment of applications and publication of URIs, and the application developer that controls the contents of messages including http headers. The application developer can develop the application without affecting the URI with the state id information and so avoid a potential conflict with the administrator. Many, if not most, applications are built to exchange state information that is not ?identifying? information, such as session ids. This is evidenced by the widespread use of HTTP Cookies. In the cases where these applications are also exchanging identifying information, the application development is simpler when the same mechanisms are used for exchanging both types. Examining the Web example, the application developer can easily insert and parse information in the cookie header, rather than rewriting the URI that is sent. In the Web services example, it is very easy to do dispatch based upon a soap header block, which is an XML QName. The tree-like structure of XML and use of SOAP and SOAP Header blocks means that an application developer can use widely available tooling, such as JAX-RPC handler chains, that makes it easy to use the XML QNames. On the converse, there are no standards available for inserting or parsing a QName(s) into or from a URI. It is worth explicitly noting that there is a trade-off between the control over the URI versus other parts of the message body, and a trade-off between the ease of updating/parsing URIs and the other parts of the message body. Scalability Scalability is directly related to the availability of important resources for requested load. The resources can be processes, threads, memory, cpu cycles, database connections, network connections. Allocation and re-use of the resources happens on a per-resources basis. For example, most applications use database connection pooling but they typically gain the functionality from middleware of some kind. The scalability trade-off is whether the cost of acquiring the necessary resources for a request is best served with the state on the client or on the server, and that completely depends upon how the resources are freed up and then re-allocated. In the simplest case, it may be that not freeing up the resources an amount of time is the most scalable. Keeping the state in memory, with a time-out optimized for typical client latency, can scale better than release resources when the time-out is set correctly and the resource acquisition/freeing is significant. Anecodatally, Jim Gray has observed that 5 minutes has been a historically accurate cut-off time for caching in memory rather than persisting to disk. In other configurations, it may be that it is ?cheaper? to free up resources by responding with the session id in the response and persisting the data to the database rather than responding with the entire state to the client because the ?cost? of transmitting to the client is more expensive than the cost of sending to the database. Likewise, it may be ?cheaper? to reify the session by acquiring it from the database than from the client. However, the cost of freeing up state and recovering state is based on a variety of factors, specifically the system architecture and the connections to the client and database, the middleware software, the database software, and hardware/software platforms used for the system. Reliability Reliability of Stateful applications has two distinct aspects: reliability of the machines and reliability of the network. The reliability of the network is not typically a factor in the design of the application style, as it is typically assumed that the network is unreliable. The aspect of reliability that concerns this writing is machine reliability. For a given client, the two time periods of interest are during a request and in between requests. In a stateless application design, a machine can fail between a request without affecting the clients view of the system. They send a request and it is dispatched to an available machine. If a machine has crashed in between requests, there is no disruption. In a stateful application design, the systems can be designed to handle failures between a request. Common techniques are duplicating the state ? RAID disks, back-up nodes ? and hardening the system ? UPS, memory checksums, etc. For example, an application server can have a primary and backup node. If a machine fails, then the backup node is used for subsequent requests. Stateful and stateless application design must deal with the situation of where a machine crashes during a request. In stateless applications, typically the request is lost. Let us make a simplifying assumption that the request is ?atomic? and is either completed or aborted. This allows us to avoid the problem of determining application state where the problems of meaning of reliability in a synchronous environment arise. Many systems are designed to handle machine failure during processing by having a stateful ?dispatcher? that has tracked the request and can replay the request to a different machine if one fails during a request. Related to Reliability is manageability, as systems are often managed for reliability. For example, a component may be starting to behave erratically and the administrator wishes to replace the component. Stateful and stateless systems would probably be designed for this task by letting the requests ?drain? off of the system that is due for maintenance. A stateful system has the downside that the states may be long-running and hence take longer to ?drain?. Advances in application server technology provide for managing these by supporting ?transferring? state from one machine to another. The discussions so far have not discussed ?client? reliability. Web based systems typically have a simplifying assumption that browsers are for a single user, are unreliable, and responses must be received within about 30 seconds for good HMI design. We have made a simplifying, but erroneous assumption that systems where the client has the state are ?stateless?. The state always exists somewhere, and in many EAI and B2B systems, the web based simplifying assumptions are not true. The system, whether it is the client or the server or the network, must contain the state. Imagine a system where the client keeps the state for long periods of time, it must deal with reliability of the state information. If a machine crashes, the system can?t lose the state. Stateful systems can deliver virtually the same reliability as stateless systems, it is more appropriate described as a matter of cost. A stateful system may require more costly infrastructure in the form of components selected to achieve the same reliability. OTOH, the difference between the reliability for a stateless system versus a stateful system may be small given the overall reliability desired. <<TBD: Improve text to hit the point that many systems are not like the human-centric web and ?stateless? is a complete misnomer>> Network Performance <<TBD>>
Received on Monday, 17 October 2005 14:50:57 UTC