[Editorial Draft] State in Web application design

1 Introduction

This is a draft TAG finding on State. The purpose of the finding is to provide guidance to application developers on the use of Stateful or Stateless applications in a Web context. It examines a variety of designs for a canonical example application to illustrate the complex trade-offs in the designs. It uses HTML browser based and Web service based examples to show the similarities between the design decisions. The finding concludes with an analysis of the architectural property trade-offs between stateful and stateless applications.

2 What is State

State is the data that pertains to an entity at a particular point in time. A variety of software entities have state, ranging from applications to operating systems to network layers. The state of an entity changes over time triggered by some kind of event. The event could be a network message, a timer expiring or an application message. Entities that do not have state, that is there is no trigger that causes a transition, are called stateless. Most interesting resources have state of one sort or another, which is what allows them to provide interesting information when interacting with user agents on the Web. This finding concerns itself especially with the following kinds of state:

Application state, which broadly is the state of a particular application;
resource state, which is generally the state related to a resource identified with a URI. In the Web context, typically application state will align with resource state, that is the state of the resources is the state of the application. One aspect of resource state is sub-resource state, that is resources that do not (and arguably should have been) themselves separately identified by a URI...for example, a bank account that is identified by a bank account number stored in a cookie, rather than in a separate URI;
per user or per session state, which can cause a resource to interact differently according to the user making the access or the network connection on which the request is received;

This finding starts from the perspective that truly stateless systems are uninteresting. Further, most systems that are advertised as stateless are actually incorrect characterized or they are doing state management someplace else in the system. The real questions of system design with respect to state is where and how the various types of state will be managed, and who has read/write access to the state. The decisions for state management are taken in the broader context of overall system design. This finding will look into how state, such as authentication information and application state, is exchanged in a Web and Web services context. It will examine the trade-off in properties between simplicity in server design versus simplicity in component design, and explore additional properties such as scalability, reliability, and performance.

3 State in applications

The state in an application may exist across a large variety of applications. A prototypical Web application will have a web browser communicating with a Web server. The Web server is the first point of contact for the application, which could consist static HTML pages, PHP generated pages from a mySQL database, a Java application running in an application server communicating with a high-end SQL database, and many more configurations. Despite many years of best efforts to insulate or abstract application design from the underlying components, it is a reality that the allocation of state to the components and the selection of components are directly coupled.

3.1 Browser State

The Web browser is one half of the design of state in a Web application, and it has specific state items that it can manage. It is important for our analysis to state that a client that stores data related to an application, such as username/passwords for URIs, is considered a stateful application. Web browsers are stateful clients because they store state, despite being typically mislabeled as "stateless". It is even more useful to analyze the type of state that browsers store and manage. Typically the web browser state is roughly related to the amount of information that the user has typed in or configured, which has natural limits. We find that generally the state managed by the web browser (aka web browser state) is not what one would consider large in today's computing terms. For example, web browsers are not storing multiple megabytes of data (like a digitized signature) per URI. There are additional state aspects of browser interactions that may be stored. There is a modestly popular firefox extension that will store and allow reload of the entire browser session, such as all the open windows. The pages being viewed is another application specific type of state that can be stored and managed. An incomplete list of state managed by web browsers is: cached pages; username/password per realm; cookies; form auto-complete data; previously viewed pages (history); currently viewed pages; home page; configuration settings such as text sizes, colors, fonts, etc.. This is a fairly extensive list of state being managed.

In addition to state that the application directly understands and manages, there is state that may be managed by the browsers but that is evaluated or interpreted at the server. A primary example of this is HTTP cookies. The state inside a cookie, including the variation where a session id is in the cookie, is opaque to the browser. It is the server application that has read/write access to the state.

Probably the primary entity that manages state is a server, from ftp to web to application server. [FOLDOC] provides a useful and interesting definition of stateful versus stateless servers. "A stateless server is one which treats each request as an independent transaction, unrelated to any previous request. This simplifies the server design because it does not need to allocate storage to deal with conversations in progress or worry about freeing it if a client dies in mid-transaction. A disadvantage is that it may be necessary to include more information in each request and this extra information will need to be interpreted by the server each time. An example of a stateless server is a World-Wide Web server. These take in requests (URLs) which completely specify the required document and do not require any context or memory of previous requests. Contrast this with a traditional FTP server which conducts an interactive session with the user. A request to the server for a file can assume that the user has been authenticated and that the current directory and transfer mode have been set.". This definition, while true, is very easily open to misinterpretation. Many of the Web applications that are built on top of a Web server are very stateful. They may have log-on sessions, application sessions like shopping carts, etc. that all use context or memory from previous requests. The design decision of a stateful application over a stateless connection (aka Cookies with Session IDs or EndpointReferences with Reference Parameters) versus a stateful application over a stateful connection (aka FTP) is a very detailed decision, and not nearly as obvious as it might seem.

This definition, and the subtleties contained therein, touch on the heart of this finding, which is that design state in systems is part of a complex task that requires detailed analysis.

4 Decisions

We now quickly go through the various decisions that are made in application design.

Decision

1. Is there application state

Decision

2. Where is application state stored?

Decision

3. Is there session state?

Decision

4. If no session state, does the client store any data, ie is it stateful?

Decision

5. If there is session state, does the client store the session state or an identifier for the session state that is stored on the service.

It is possible to never store the data on the client or the server. However, we almost never choose completely stateless clients and servers. In a browser based application, it would be very frustrating to enter username and password for every request. This is why the browser stores username/passwords, as well as cookies. Realistically, the choice is whether to store the data used to create the state, such as username and password, on the client or whether there is session state, such as "logged-in".

If the client is stateful (yes to question #4), then the data that is stored in the client is sent to the service for each request. If the decision is to store the state ( rather than data used to create the state), the next decision (#5) is whether the state is stored on the client or the server. Note that applications where the client stores data, beit data for recreating state or the state itself, are typically (and erroneously) called stateless applications, even though there is state on the client.

Decision

6. If session state stored on the server, how does client send the state identifier?

For Web messages, the location of the state id can only be in the URI, in an HTTP header or in the message body. We exclude the possibility of expressing state in the protocol operation or method.

A number of examples will show a variety of decision combinations for Web, XML and Web service technologies.

5 HTML examples

This section introduces an abstract example and various web browser designs to implement the abstract example.

5.1 Abstract example

Dirk decides to build an online banking application. Customers will be able to view their account balances and make transfers. The first step is logging on to the application. When the customer selects accounts view, the banking application will ask them for their username and password. If they have already entered their username and password, they will not be asked for it again. The system will automatically log the customer out if they have been idle for 10 minutes.

We see a prototypical stateful application from the client perspective. This example introduces an application that involves two types of state: a session state for logging in and persistent application state for the account. The session state may be realized by storing state on the client or on the server. The example is considered abstract because it is independent of the underlying technology choices, such as HTTP, Cookies, SOAP, WS-Addressing, etc. The example will be elaborated in HTML Browser and XML based interactions.

The bank account application clearly has account balance state and it is stored in the server, though clients may have a local copy of the state. The design of the authentication interaction requires a number of decisions related to session state.

5.2 Example using HTTP Authentication

Dirk decides that the banking application will maintain no session state on the server and the client will send any necessary data for each request. The application has a URI for the entry page to the banking application and a link to the account balances. When any banking URI is requested, the username/password features of HTTP are used, usually implemented as a pop-up window asking for username and password.

Example 1: HTTP Authentication

GET /acct/123456789  HTTP/1.1
Host: www.fabrikam.com
user-pass: username:password

Note: user-pass is shown for relevence and convenience and the actual HTTP Authorization header would be different than shown above.

There are very few web sites that are built in such a stateless server manner, perhaps the largest is the W3C web site. Most web sites use alternative technologies for logging in and they store the state using HTTP cookies or using URL rewriting. The primary reasons for customized security are ease of use concerns, particularly wanting direct control over the look and feel of the screens including helpful tips and links to forgotten passwords. It is not clear whether this helps or hinders security concerns, that is wanting greater control over the security timing out.

5.3 Example using URL Rewriting with session in the URL

Dirk decides that a customized security screen is needed. A new page with the entry of username and password is inserted in the application, after the "show item" page in the state flow. At first, Dirk was going to have the URL contain the username and password. From a state analysis perspective, there is no significant difference between storing a username/password per URL or a URL that contains a username/password.

Example 2: URI rewriting with username/password

GET /acct/123456789?user=username&pwd=password  HTTP/1.1
Host: www.fabrikam.com

A key difference is security, so the URL containing username/password was rejected for obvious security reasons. Upon successful completion, the URL is rewritten to contain the state that the user has logged on. After the security page, any URLs in pages returned are rewritten to contain the state and the state is encrypted to prevent tampering and guessing. Dirk has quickly moved into a decision that the application will have session state.

Example 3: URI rewriting with session id

GET /acct/123456789?sessionid=5
Host: www.fabrikam.com

The URL with session id approach has a significant downside as it is unlikely that URLs with a particular users login state need to be exchanged or bookmarked. From a modeling perspective, the resources that would likely be identified are accounts and particular transactions, not login state. If the user ever forwarded or bookmarked the URL, the login state will be useless at best and confusing and inefficient at worst. Also, it may be difficult for the application to have full control over the URL and do the rewriting, and it may be difficult for the application to parse the URL to extract the state. Stepping back a bit, the issue is that the application state (the account) and the session state( login state) may need to be independent for a variety of reasons.

5.4 Example using Cookies with client-side state

HTTP Cookies offer the benefit of a well-defined place, the HTTP Cookie header, for storing and retrieving data without rewriting the URL.

Nadia decides to change the banking application to store the application state in a cookie. The application still has URIs for the banking application page. The application stores the state in a cookie that is sent to the browser upon successful completion of the page, and sent back to the service on every request.

Example 4: Cookie with client side state

GET /acct/123456789  HTTP/1.1
Host: www.fabrikam.com
Cookie: $Version="1"; user="username"; pass="password"

Yet still, very few web applications are built this way. Most secured web sites use cookies where the state is stored on the server, rather than encapsulated in the client. The motivations are primarily about security, particularly giving the serviced application the control over whether to keep the state in memory or passivate to disk. Again, storing the state in a URL, in an HTTP Cookie (which is a special HTTP header) or in an HTTP Authentication special memory area all make the client stateful. In general application design, there are other concerns around client-side state as the state could get quite large so the constraints on the client storage could be onerous, it may be difficult to serialize and so serialization to the client could be difficult, or the network performance could be significant.

5.5 Example using Cookies with session ids

Nadia further updates the banking application to store the log-in state in a server side component. The server-side component is identified with an id, commonly called a session id. This session id is stored in the cookie.

Example 5: Cookie with client side session id

GET /acct/123456789 HTTP/1.1
Host: www.fabrikam.com
Cookie: $Version="1"; sessionid="5"

5.6 Stateful resource identifiers

The previous examples explored the issues and designs related to session identification and transmission. As described in the URL rewriting example, the session information is probably not a stateful resource that requires an identifier. However, a particular user's account view, particular bank account or particular transaction is intuitively a stateful resource where the identifier could include the particular account or transaction identifier.

In the banking application with account state - moving on from the log-in state - there are 2 different account balance URI designs: one URI for all users or URI per user. The first design does not have distinct URIs for each of the user account balances. Rather, there is a "dispatch" URI and the particular user account requested is encoded in the request message or headers. For example, after logging in, the http cookie contains the user id. When the user requests the generic page, the particular user id is sent in the HTTP POST data.

The second design has a distinct URI for each of the user ids. The user clicks on the login, and this redirects them to a unique URI for their account. The URI per account design, sometimes called "deep-linking", has all the network effect advantages that the web has to offer: the users account is bookmarkable, exchangeable, etc.

Example 6: Two URI designs

GET /acct/  HTTP/1.1
Host: www.fabrikam.com
Cookie: $Version="1"; acct="123456789"

GET /acct/123456789  HTTP/1.1
Host: www.fabrikam.com

It does suffer from potential increased complexity as it may be easier to populate and parse the data from someplace other than the URI, such as FORM POST or cookie data. Another problem with selecting a URI that takes them to say 'cleared checks for my savings account' then if the website is redesigned (a frequent event, at least on the back end) then that URI will break. Either that or the website has to maintain complex mapping tables to handle versioning URIs across multiple versions of the website. Hence many websites would rather just force users to come in through a well defined home page and then focus on making navigation as easy as possible to get them quickly to where they want to be.

It is worth noting that the application has 2 different types of state information that are being identified: the account balance and the session state. By putting the account id in the URI and keeping the session id separate, the application has achieved a separation and the different benefits achievable from the transient session information and the network effect of re-usable URIs.

6 XML interactions

The previous examples showed how browser based technologies support stateful clients and session based interactions. There are also the same issues in XML interactions.

6.1 XML example

Dirk is tasked with making the banking application available as a Web service rather than HTML pages. He uses XML to do this. All the possibilities shown in the previous HTML example are available for returning XML. For example, he could use HTTP Authentication to return an XML document with the balance.

Example 7: HTTP Authentication with XML

GET /acct/123456789 HTTP/1.1
Host: www.fabrikam.com
user-pass: username:password

returns:
<Balance acct="123456789">2000</Balance>

However, XML gives him the option to also send XML in the HTTP Body. He steps back and decides that the banking application is a service with an interface containing two operations:log-in and getBalance. The first operation is a log-in operation. If successful, it returns an XML document containing a session ID that client should use for requesting the account information.

TBD need WSDL. Need more motivation on this part, related to WSDL.

Example 8: An example login request and response

POST /acct/123456789 HTTP/1.1
Host: www.fabrikam.com
<login>
	<username>foo</username>
	<password>bar</password>
</login>

response:
<loginResponse>
	<status>OK</status>
	<sessionId>5</sessionId>
</loginResponse>

A request to the service, such as "GetBalance", might have a fragment like:

Example 9: GetBalance fragment

POST /acct/123456789 HTTP/1.1
Host: www.fabrikam.com
<Request>
	<Action>GetBalance<Action>
	<sessionId>5</sessionId>
</Request>

returns:
<Balance acct="123456789">2000</Balance>

Dirk must know the "choreography" of these interactions, particularly that the sessionId in the loginResponse is used for a following GetBalance request.

Alternatively, the session Id could be represented as an HTTP Content-Location header

Example 10: An example login fragment response with session id in an HTTP Header

Content-Location: http://www.fabrikam.com/acct/12345689/?sessionId=5
					
<loginResponse>
	<status>OK</status>
</loginResponse>

Followed by
GET /acct/123456789?sessionId HTTP/1.1
Host: www.fabrikam.com

TBD. Lots of notes on this design.

6.2 Web service example

Dirk is tasked with making the banking application available using SOAP and WS-Addressing technologies in addition to XML and WSDL. The first operation is a log-in operation. If successful, it returns a WS-Addressing "ReplyTo" containing an EPR that client should use for requesting the account information. The EPR contains a reference parameter that contains the session id and a reference parameter that contains the account id.

Example 11: An example, slightly modified from the WS-Addressing specification

<S:Envelope xmlns:S="http://www.w3.org/2003/05/soap-envelope"
         xmlns:wsa="http://www.w3.org/2005/08/addressing"
         xmlns:fabrikam="http://example.com/fabrikam">
   <S:Header>
     ...
    <wsa:To>http://example.com/fabrikam/acct</wsa:To>
    <wsa:Action>http://example.com/fabrikam/login</wsa:Action>
     ...
   </S:Header>
   <S:Body><login><user>username</user><pass>password</pass></login>
   </S:Body>
</S:Envelope>

Returns:
<S:Envelope xmlns:S="http://www.w3.org/2003/05/soap-envelope"
         xmlns:wsa="http://www.w3.org/2005/08/addressing"
         xmlns:fabrikam="http://example.com/fabrikam">
   <S:Header>
       <wsa:Action>http://example.com/fabrikam/loginResponse</wsa:Action>
	<wsa:ReplyTo>
	  <wsa:EndpointReference
     		xmlns:wsa="http://www.w3.org/2005/08/addressing"
		xmlns:fabrikam=http://example.com/fabrikam>
   		<wsa:Address>http://example.com/fabrikam/acct</wsa:Address>       
   		<wsa:ReferenceParameters>
    			<fabrikam:CustomerKey>123456789</fabrikam:CustomerKey>
    			<fabrikam:SessionID>5</fabrikam:SessionID>
   		</wsa:ReferenceParameters>
  	</wsa:EndpointReference>
	</wsa:ReplyTo>
</S:Header>
<S:Body>
	<response>OK</response>
</S:Body>

A request to the service, such as "GetBalance", might have a fragment like:

Example 12: GetBalance fragment

<S:Envelope xmlns:S="http://www.w3.org/2003/05/soap-envelope"
         xmlns:wsa="http://www.w3.org/2005/08/addressing"
         xmlns:fabrikam="http://example.com/fabrikam">
   <S:Header>
     ...
    <wsa:To>http://example.com/fabrikam/acct</wsa:To>
    <wsa:Action>http://example.com/fabrikam/GetBalance</wsa:Action>
    <fabrikam:CustomerKey wsa:IsReferenceParameter='true'>123456789</fabrikam:CustomerKey>
    <fabrikam:ShoppingCart wsa:IsReferenceParameter='true'>5</fabrikam:ShoppingCart>
     ...
   </S:Header>
   <S:Body/>
</S:Envelope>

Alternatively, the CustomerKey could be encoded in the address property in the EPR

Example 13: CustomerKey in the EPR

<wsa:ReplyTo>
  <wsa:EndpointReference
     xmlns:wsa="http://www.w3.org/2005/08/addressing"
     xmlns:fabrikam=http://example.com/fabrikam>
   <wsa:Address>http://example.com/fabrikam/acct/123456789</wsa:Address>       
   <wsa:ReferenceParameters>
    <fabrikam:SessionID>5</fabrikam:SessionID>
   </wsa:ReferenceParameters>
  </wsa:EndpointReference>
</wsa:ReplyTo>

6.3 EPRs "on the Web"

The WS-Addressing specifications do not provide a binding or mapping of WS-Addressing Message Addressing Properties (MAPs), including EPRs, into an HTTP request. Further, there does not appear to be any industry standard for such a binding. Without such a binding, most if not all EPRs that are created with Reference Parameters will not be available on the Web. All the example URIs listed in the Web section could be used for application to application communication.

6.4 Web services authentication state

WS-Security specifies a SOAP header block for securing SOAP messages. One form of WS-Security is the username/password, as specified in the username token profile.

Example 14: WS-Security username/password example.

<S:Envelope xmlns:S="http://www.w3.org/2003/05/soap-envelope"
         xmlns:wsa="http://www.w3.org/2005/08/addressing"
         xmlns:wsse="http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-wssecurity-secext-1.0.xsd"
         xmlns:fabrikam="http://example.com/fabrikam">
   <S:Header>
     ...
    <wsa:To>http://example.com/fabrikam/acct</wsa:To>
    <wsa:Action>http://example.com/fabrikam/GetBalance</wsa:Action>
    <wsse:Security>
    	<wsse:UsernameToken>
    		<wsse:Username>username</wsse:Username>
    		<wsse:Password>password</wsse:Password>
    	</wsse:UsernameToken>
    </wsse:Security>	
     ...
   </S:Header>
   <S:Body>
   	<Account>123456789</Account>
   </S:Body.
</S:Envelope>

6.5 Web services session

WS-SecureConversation specifies a security token that represents a secureconversation. The context is negotiated prior to the application request.

Example 15: WS-SecureConversation example.

<S:Envelope xmlns:S="http://www.w3.org/2003/05/soap-envelope"
         xmlns:wsa="http://www.w3.org/2005/08/addressing"
         xmlns:wsse="http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-wssecurity-secext-1.0.xsd"
        xmlns:wsc="http://schemas.xmlsoap.org/ws/2005/02/sc"      
         xmlns:fabrikam="http://example.com/fabrikam">
   <S:Header>
     ...
    <wsa:To>http://example.com/fabrikam/acct</wsa:To>
    <wsa:Action>http://example.com/fabrikam/GetBalance</wsa:Action>
    <wsse:Security>
    	<wsc:SecurityContextToken>
		<wsc:Identifier>uuid:...</wsc:Identifier>
    	</wsc:SecurityContextToken>
    </wsse:Security>	
     ...
   </S:Header>
   <S:Body>
   	<Account>123456789</Account>
   </S:Body.
</S:Envelope>

7 State decision factors

The decision on where to place the state in the distributed application and how to identify the state are affected by numerous factors. Some of the key considerations are scalability, reliability, network and application performance, security, ease of design and promoting network effects on the World Wide Web, I.e. leveraging and contributing to a single, global information space

Roy Fielding argues in his REST dissertation [REST] that stateless server has the benefits of increasing reliability, scalability, visibility and while potentially decreasing network performance. However, I believe the trade-offs from an application developers perspective are somewhat different, and need to be examined from a holistic perspective.

8 Conclusion

This Finding describes a number of questions, decisions and rules for using state in Web and Web service architecture and design. The main goal of the finding is to describe the choices facing developers with a describing of the properties of interest and some of their trade-offs

9 References

FOLDOC: Free Online Dictionary of Computing (See http://foldoc.doc.ic.ac.uk/foldoc/foldoc.cgi?query=stateless&action=Search.)
henryepr: Henry's EPR example. (See http://lists.w3.org/Archives/Public/www-tag/2005Nov/0008.html.)
REST: REST dissertation, Dr. Roy Fielding (See http://www.ics.uci.edu/~fielding/pubs/dissertation/net_arch_styles.htm#sec_3_4 .)
tagnovf2fdisc: TAG Nov f2f EPR discussion. (See http://www.w3.org/2001/tag/2005/12/06-Afternoon-minutes.html#item05.)
HTTP: RFC 2616, HTTP. (See http://www.ietf.org/rfc/rfc2616.txt.)
HTTPAuth: RFC 2617, HTTP Authenticaiton. (See http://www.ietf.org/rfc/rfc2617.txt.)
SOAP 1.2: W3C Recommendation, SOAP 1.2 Part 1: Messaging Framework (See http://www.w3.org/TR/SOAP/.)
WSDL 1.1: W3C Note, WSDL 1.1 (See http://www.w3.org/TR/WSDL/.)
XML 1.0: W3C Recommendation, XML 1.0 (See http://www.w3.org/TR/REC-xml.)
XML Namespaces: W3C Recommendation, XML Namespaces (See http://www.w3.org/TR/REC-xml-names.)
WSSecurity: WS-Security (See http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-soap-message-security-1.0.pdf.)
WSSecurityTokenProfile: WS-Security Username Token Profile (See http://docs.oasis-open.org/wss/2004/01/oasis-200401-wss-username-token-profile-1.0.pdf.)
WSSecureConversation: WS-SecureConversation (See http://specs.xmlsoap.org/ws/2005/02/sc/WS-SecureConversation.pdf.)

10 Acknowledgements

The author thanks the many reviewers that have contributed to this document, in no particular order: Mark Baker, Dan Connolly, Alexander Macaulay, Noah Mendelssohn, Gilbert Pilz.

[Editorial Draft] State in Web application design

Draft TAG Finding 15 February 2006

Abstract

Status of this Document

Table of Contents

1 Introduction

2 What is State

3 State in applications

3.1 Browser State

3.2 Server State

4 Decisions

5 HTML examples

5.1 Abstract example

5.2 Example using HTTP Authentication

5.3 Example using URL Rewriting with session in the URL

5.4 Example using Cookies with client-side state

5.5 Example using Cookies with session ids

5.6 Stateful resource identifiers

6 XML interactions

6.1 XML example

6.2 Web service example

6.3 EPRs "on the Web"

6.4 Web services authentication state

6.5 Web services session

7 State decision factors

7.1 Ease of Application construction

7.2 Security

7.3 Scalability

7.4 Performance

7.5 Reliability

7.6 Network Performance

8 Conclusion

9 References

10 Acknowledgements