- From: Mark Nottingham <mark.nottingham@bea.com>
- Date: Mon, 15 Nov 2004 20:56:52 -0800
- To: Francisco Curbera <curbera@us.ibm.com>
- Cc: David Orchard <dorchard@bea.com>, public-ws-addressing@w3.org
Hi Paco, Could you develop this into a proposal, expressed in terms of what you'd change in the specification? Thanks, On Nov 13, 2004, at 9:34 AM, Francisco Curbera wrote: > > I appreciate very much Dave's proposal and the advantage of having his > Web > architecture expertise to help us navigate these murky waters. However, > before we approach the TAG or do anything drastic like that, I think I > need > to explain my qualms about the way we seem to be approaching this > issue; I > have the feeling I am in a minority in this, but I fear an important > perspective may be lost in this discussion. > > First, we need to admit that the specification contains specific > language > to prompt this concern and the current resolution approach. In > particular, > Section 2.1 in the core document draft talks about using reference > properties to "identify" the entity being conveyed. However, I claim > that > this language is misleading and not consistent with the spirit of the > spec; > moreover, if we take it to its final conclusion important use cases > will > not be appropriately supported. > > My central point is that the idea that EPRs identify service endpoints > is > wrong and potentially dangerous. There is a key different between > identifiers and addresses. The Web architecture document correctly > states > that different identifiers (URIs) should not be associated with the > same > resource; while aliases are not prohibited, they do undermine the > usefulness of URIs as identifiers and impose unnecessary cost on URI > consumer applications. The notion of identifiers is essentially at odds > with having multiple for the same entity. > > However, in the case of EPRs the ability to issue multiple different > ones > for the same endpoint is a fundamental requirement. There are > situations > where multiple access channels to the endpoint are provided but they > need > to be selectively exposed to different clients; the hosting > infrastructure > may issue different EPRs to different clients such that a client > application would be able unable to tell whether they correspond to the > same resource or not. To abuse the snail mail metaphor a bit more, the > address that is encoded in a letter sent to me cannot be used to > identify > me in the way my social security number can, but is useful if you want > to > get your letters to me. I have several mail addresses but a single > social > security number. The notion of address assumes that the same entity may > have many, in contrast with the notion of identity (as the Web Arch. > document recognizes). > > A "wire-centric" interpretation of EPRs, I think, is much more > consistent > with the spirit of Web services than this identity based approach. An > EPR > encapsulates the information that must be conveyed in a message > envelope to > ensure that it can be properly delivered to the endpoint. This is > clearly > not the same as providing an identifier for the resource, but is > essential > to ensure interoperable access to endpoints. Regardless of how > inconsistent the language in the current spec is today, we need to > make it > clear that WS-Addressing is not about identifying resources but only > about > providing the means to interoperably direct (address) messages to them. > Moreover, I think we should state that the identification of endpoints > is > not within the scope of this WG. We just need to make these points > clear in > the spec and let the TAG rest at ease knowing we are not trying to > break > their carefully constructed Web architecture. > > Paco > > > > > > > > > "David Orchard" > <dorchard@bea.com> To: > <public-ws-addressing@w3.org> > Sent by: cc: > public-ws-addressing-req Subject: i0001: > EPRs as identifiers > uest@w3.org > > > 11/13/2004 01:13 AM > > > > > > I offer a proposal for what the issue is, and a solution > > Issue: > > WS-Addressing EPRs specify a resource identification mechanism, called > reference properties and reference parameters, in addition to URIs for > identification purposes. There is not a clear justification of the > benefits of such an additional resource identification mechanism. > > The W3C Web architecture [1] states “To benefit from and increase the > value > of the World Wide Web, agents should provide URIs as identifiers for > resources. Other resource identification systems (see the section on > future directions for identifiers) may expand the Web as we know it > today. > However, there are substantial costs to creating a new identification > system that has the same properties as URIs.” > > The W3C TAG was asked the question about when to use GET for retrieving > resource representations and indirectly about when URIs should be > provided > for resources in Issue #7 [2] and produced a finding [3]. Some of the > finding material is included in the Web arch document. The Web > architecture is clear that there are substantial costs associated with > resource identification systems other than URIs and the implication is > that > the benefits to such additional systems should be substantial. > > The URI specification [4] provides a definition of a resource “A > resource > can be anything that has identity.” Thus we do not need to determine > whether an EPR identifies a resource or not, but whether an EPR is > used as > an identifier. > > The WS-Addressing member submission [5] is fairly clear that EPRs are > used > for identification purposes. Some sample quotes used in the document: > “Dynamic generation and customization of service endpoint > descriptions. “ > “Identification and description of specific service instances” > “we define a lightweight and extensible mechanism to dynamically > identify > and describe service endpoints and instances” > “Specific instances of a stateful service need to be identified” > “A reference may contain a number of individual properties that are > required to identify the entity or resource being conveyed” > > A tell-tale sign of identifiers is comparisons of identifiers. The URI > specification provides rules for URI comparison. The WS-Addressing > submission provides rules for EPR comparison. > > Note that the Web architecture does not discuss stateful versus > stateless > services or interactions, nor does it discuss the use of HTTP Cookies > to > contain state or state identifier information. To a certain degree, > the > comparison of URIs vs EPRs may be thought of as a comparison of URIs + > HTTP > cookies vs EPRs. However, this comparison will describe EPRs with > reference properties without discussion of HTTP cookies, but the > similarity > of the benefits of HTTP cookies and EPR reference properties is > captured in > the EPR benefits. > > [1] http://www.w3.org/TR/webarch/#uri-benefits > [2] http://www.w3.org/2001/tag/issues.html#whenToUseGet-7 > [3] http://www.w3.org/2001/tag/doc/whenToUseGet.html > [4] http://www.ietf.org/rfc/rfc2396 > [5] http://www.w3.org/Submission/2004/SUBM-ws-addressing-20040810/ > > > Resolution: > The WS-Addressing WG will provide material, TBD format such as > standalone > or primer or …, that shows the benefits to be gained from WS-Addressing > reference properties and parameters. It includes a comparison with URI > only solutions. > > Sample applications and benefits > A sample application is introduced with a skeletal display of use of > URIs > and EPRs. > > Sample application #1: Stateful Web service client. > > A stateful service acting as a client makes a request to another > service. > The client makes a request containing a ReplyTo containing an EPR. The > invoked service responds with the requested information including an > WS-Addressing EPR processing model > > Variation #1: Reference properties > Client->Service:request > <s:Header> > <wsa:ReplyTo> > <wsa:EndpointReference> > <wsa:Address>http://www.fabrikam123.example/acct</wsa:Address> > <wsa:ReferenceProperties> > <fabrikam:CustomerKey>123456789</fabrikam:CustomerKey> > </wsa:ReferenceProperties> > </wsa:EndpointReference> > </wsa:ReplyTo> > </s:Header> > > Service->Client Callback > > <S:Header> > <wsa:To>http://www.fabrikam123.example/acct</wsa:To> > <fabrikam:CustomerKey>123456789</fabrikam:CustomerKey> > </S:Header> > > Variation #2a: Address only with full featured Qname to URI mapping > > This takes the fabrikam:CustomerKey Qname and content and incorporates > it > into the Address using an extension of QName/URI binding style #10 in > [1] > Client->Service:request > <s:Header> > <wsa:ReplyTo> > <wsa:EndpointReference> > > <wsa:Address>http://www.fabrikam123.example/acct? > (fabrikamns)CustomerKey=123456789</wsa:Address> > </wsa:EndpointReference> > </wsa:ReplyTo> > </s:Header> > > Service->Client Callback > > <S:Header> > > <wsa:To>http://www.fabrikam123.example/acct? > (fabrikamns)CustomerKey=123456789</wsa:To> > </S:Header> > > Variation 2b: Simple Address > The Address may be significantly simpler, such as > > <wsa:Address>http://www.fabrikam123.example/acct/123456789</wsa: > Address> > > > Comparison of Variations. > > This comparison uses the Architecture properties of Key interest > section > from Dr. Fielding’s thesis [2] as the criteria for evaluating these 2 > styles. This is based upon the network characteristics of the > architectures. Note that the thesis specifically excludes those > properties > that are of interest to software architectures. > > Performance > Advantage of URIs > Comparing URIs is simpler than comparing EPRs because the cost of > canonicalizing EPRs can be significant given the XML C14N algorithm. > > > Scalability > No difference. Both styles support stateful and stateless > interactions. > > > Simplicity > > Advantages of EPRs > Web services are based upon XML. Many applications use XML as the > mechanisms for identifying their components. The binding of XML into > URIs > is not standardized and potentially problematic, some of the issues > being: > - XML contains QNames as element names, attribute names, and content. > QNames are based upon absolute URIs. URIs in URIs is not simple. > - XML elements can have multiple children at all levels, whereas URIs > have > path hierarchy that ends in a multiple children query parameters. > - The XML information model is complex with attributes, elements, PIs, > comments, entity references and whitespace. These do not match well to > URIs. > - Character encodings are different between XML and URIs. > - URIs have potential length restrictions > - URIs have different security properties than SOAP header blocks, > such as > level of encryption and signing. > > XML applications that use XML for identification will probably be > simpler > to write with EPRs than with URI only identifiers. This includes SOAP > tools and XML tools. > > Advantages of URIs > In many cases, the complexity of XML is not needed for the identifier, > as > shown in example 2b. This enables the web service to be "on the Web" > as an > HTTP GET can be used to retrieve a representation of the state of the > resource. This also enables much of the web infrastructure to operate, > such as caching intermediaries, security firewalls, etc.. This can > lead to > easier to debug systems (a web browser can retrieve the state for a > human).. > > > Evolvability > Advantage of EPRs > Separating the reference property from the URI may make it easier for > service components to evolve. A service component may know nothing > about > the deployment address of the service from the reference properties. > This > effectively separates the concerns of identifiers into externally > visible > and evolvable from the internally visible and evolvable. For example, > a > dispatcher could evolve the format it uses for reference properties > without > concern of the URI related software. The use of SOAP tools - for > parsing > the soap header for the reference properties - or xml tools - such as > an > xpath expression on the message - allow separate evolvability of > components. > > Advantage of URIs > No advantage to URIs for evolvability. > > Visibility > Advantage of EPRs > URIs provide for visibility into the interaction between two > components. > There are scenarios that indicate visibility into the reference > property is > not necessary. Inserting the reference property may hinder visibility. > The security desired may be at the address level, and inserting the URI > serialization of the ref property may harm the ability to appropriately > apply security. For example, the Address may already have query > parameters > that are part of the service identifier, and the reference property as > a > query parameter may result in difficult parsing as the query > parameters are > not necessarily order preserved. Potentially multiple reference > properties > compounds the problem. > > Additionally, a service provider may not want for the reference > property to > be visible as part of the URI. Presumably they could encrypt the > reference > property and then insert into the Address field, but this leaves us > back to > the simplicity argument and inserting XML into URIs. > > Advantage of URIs > URI only EPRs offer clearly higher visibility into the message for any > intermediary. > > Security > Advantage of EPRs > Dr. Fielding’s thesis does not directly address security. One > potential > aspect of security is “guessing” at endpoints. Encrypting the > reference > property does not cover signing a reference property. A reference > property > might be encrypted and signed by a service provider using the OASIS > WS-Security standard > > > Real-World > Advantage of EPRs > It is useful to examine not only theoretical architectures properties > but > real-world deployed architectures. A significant portion of the Web is > deployed with stateful web components that use HTTP Cookies to contain > session or state identifying information. For a variety of reasons, > typically those detailed previously, application developers have > chosen to > use HTTP Cookies to contain identifying information in addition to > URIs. > > Additionally, a variety of efforts have been undertaken to facilitate > mapping of XML and QNames to URIs, such as WSDL 2.0 HTTP Binding. > There > does not appear to be substantial product group interest in these > technologies. > > Advantage of URIs > The subset of the Web that is "on the Web", that is has a URI that is > dereferenceable, is clearly widely scalable, deployed, etc. > > Conclusion > This has shown that the choice of EPRs with Reference Properties versus > EPRs without reference properties is a complex choice best left to the > application developer. As they have the choice with a Web of HTTP > URIs and > HTTP Cookies today, WS-Addressing gives Web service application > developers > the choice of their identifier architecture. They can use URI only > EPRs > and they can use URIs + XML based reference properties and parameters. > > [1] http://www.pacificspirit.com/blog/2004/04/29/binding_qnames_to_uris > [2] > http://www.ics.uci.edu/~fielding/pubs/dissertation/ > net_app_arch.htm#sec_2_3 > > > > > -- Mark Nottingham Principal Technologist Office of the CTO BEA Systems
Received on Tuesday, 16 November 2004 04:57:01 UTC