- From: <noah_mendelsohn@us.ibm.com>
- Date: Thu, 13 Nov 2003 17:27:06 -0500
- To: Marc Hadley <Marc.Hadley@Sun.COM>
- Cc: "Xml-Dist-App@W3. Org" <xml-dist-app@w3.org>
I continue to be bothered by two of the concerns that I raised on the telcon. * Concern #1: The SOAP Recommendation says [1]: "As described in 5. SOAP Message Construct, each SOAP message is specified as an XML infoset that consists of a document information item with exactly one child: the SOAP Envelope element information item. Therefore, the minimum responsibility of a binding in transmitting a message is to specify the means by which the SOAP message infoset is transferred to and reconstituted by the binding at the receiving SOAP node and to specify the manner in which the transmission of the envelope is effected using the facilities of the underlying protocol." If I understand the proposal correctly, the bindings supporting the proposal below would tend to violate this rule when confronted with content along the lines of: <env:Envelope xmlns:env="..." xmlns:mtom="..."> <env:Body> <app:Stuff xmlns:app="..."> <app:Thing1 mtom:ContentID="someURI"> some base64 text </app:Thing1> <app:Thing2 mtom:ContentID="someURI"> some OTHER base64 text </app:Thing2> </app:Stuff> </env:Body> </env:Envelope> Granted, content of this sort specifically violates the intended use of the attribute. Maybe we were wise to not allow for bindings to break the usual rules in circumstances like this, or maybe we were shortsighted. The fact is that I don't think the current recommendation licenses a binding to change the content of an Infoset in transmission. Now, in the interest of full disclosure, I can see a bit of a counterargument, though I find it somewhat unpleasant: you might claim that the binding implements some sort of feature that, like encryption by an active intermediary, changes the Infoset. Still, I think we've more clearly licensed active intermediaries than active links, and my current reading is that the SOAP recommendation currently requires that content like that shown above be transmitted with full fidelity, or else that a binding reflect some sort of binding-specific error (and even then, I would argue that it's a pretty poorly spec'd binding that throws errors when confronted with certain perfectly good SOAP infosets. * Concern #2: keeping IDs distinct at intermediaries As I also mentioned on the phone, an intermediary wishing to optimize content would have to verify that any ID used for new content was distinct from those already in use. This seems to put a burden on intermediaries to more carefully parse headers not targeted to them then might otherwise be necessary. In the body, there could also be issues with streaming, as one does not know which IDs have been used until the end of the envelope is seen. I heard Marc suggest the use of GUIDs, and in certain environments GUIDs are practical. Still, given the need for access to the moral equivalent of an Ethernet MAC address as a seed for the GUID, I think that use of GUIDs is at best a compromise. Overall: it seems wrong to me to have bindings that depend for correctness on the integrity of content in the Infoset. MTOM optimizes based on content of infoset, but so far it never fails to send the right thing. At worst you'll make bad decisions about what to optimize. That seems to me to be one of the key attractions of MTOM, and this proposal seems to come close to changing that. In the end, I remain ambivilent about this proposal. The disadvantages include those listed above. If we could convince ourselves that 1-to-1 is OK at the binding level, then all these issues go away, along with reference counting or any other messy alternative. I suppose it comes down to our use cases: is it really important for us to include the same large image or the like in two places in the same SOAP envelope infoset? It's not clear to me that's necessary. I lean somewhat toward what I take to be Gudge's position: if you need to share a template or some such, then either live with the overhead of duplicating it, or more likely, use some sort of explicit ID/REF mechanism within the envelope to share a copy. Noah [1] http://www.w3.org/TR/soap12-part1#bindfw -------------------------------------- Noah Mendelsohn IBM Corporation One Rogers Street Cambridge, MA 02142 1-617-693-4036 -------------------------------------- Marc Hadley <Marc.Hadley@Sun.COM> Sent by: xml-dist-app-request@w3.org 11/12/2003 03:47 PM To: "Xml-Dist-App@W3. Org" <xml-dist-app@w3.org> cc: (bcc: Noah Mendelsohn/Cambridge/IBM) Subject: Proposal for multi-reference support in MTOM Here's a proposal for an extension to the current MTOM formulation to offer better support for multiple inclusion of the same data. The proposed extension has the following properties: - Preserves MTOM semantics of attachment inclusion in SOAP message infoset - Supports existing 'Include' and 'Representation' semantics, use of extension is optional - Supports multiple inclusion of attachments without replication of data in the serialized form - Multiply included data replicated in message infoset, signatures over elements containing such data include attachment data rather than a reference to the data as woud be the case when using a Representation approach. Infoset Form ============ This section shows via an example the infoset of a message after the binding has performed the MTOM deserialization (described later). XML 1.0 is used as the most convenient syntax to express the infoset but this should be considered a purely abtract model of the message content. <env:Envelope xmlns:env="..." xmlns:mtom="..."> <env:Body> <app:Stuff xmlns:app="..."> <app:Thing1 mtom:ContentID="someURI"> some base64 text </app:Thing1> <app:Thing2 mtom:ContentID="someURI"> some base64 text </app:Thing2> <app:Thing3> some base64 text </app:Thing3> </app:Stuff> </env:Body> </env:Envelope> Note that the same base64 data is included as the content of the Thing1 and Thing2 EIIs, this is indicated by the value of the mtom:ContentID attribute being the same for both. Thing3 has no mtom:ContentID indicating that the optional multi-reference extension is not being used for the content of this EII. Optimized (MIME) Wire Form ========================== This section shows via an example the serialized form of a message using the MIME based MTOM. Content-type: multipart/related; boundary="someBoundaryString" --someBoundaryString Content-Type: application/soap+xml <env:Envelope xmlns:env="..." xmlns:mtom="..."> <env:Body> <app:Stuff xmlns:app="..."> <app:Thing1 mtom:ContentID="someURI"> <mtom:Include href="someURI"> <!-- depending on how mtom:ContentID is defined, the Include/@href may be redundant --> </app:Thing1> <app:Thing2 mtom:ContentID="someURI"> <mtom:Include href="someURI"> </app:Thing2> <app:Thing3> <mtom:Include href="someOtherURI"> </app:Thing3> </app:Stuff> </env:Body> </env:Envelope> --someBoundaryString Content-Type: image/png Content-ID: someURI binary picture data --someBoundaryString Content-Type: image/png Content-ID: someOtherURI binary picture data --someBoundaryString-- Schema Types ============ <complexType name="OptimizationCandidate"> <simpleContent> <extension base="xsd:base64Binary"> <attribute name="ContentID" type="xsd:anyURI"/> <attribute name="MediaType" type="xsd:string"/> <!-- other attributes we define --> </extension> </simpleContent> </complexType> Terminology =========== The following terminology is used in the description of the serialization and deserialization algorithms: Optimization candidate: EII of type xsd:base64 or mtom:OptimizationCandidate. Matching MIME part: MIME part whose content-id and/or content-location headers (TBD specify exact matching criteria) match an OptimizationCandidate/@ContentID. Content: base64Binary child CIIs of an optimization candidate (excludes AII children) Infoset to Wire Serialization ============================= For each optimization candidate in the SOAP message - if no matching MIME part exists then create a matching MIME part from the optimization candidate's decoded content and AIIs - replace the content of the optimization candidate with a child mtom:Include EII Wire to Infoset Deserialization =============================== For each mtom:Include EII - replace the mtom:Include EII with base64 encoded attachment content -- Marc Hadley <marc.hadley@sun.com> Web Technologies and Standards, Sun Microsystems.
Received on Thursday, 13 November 2003 17:29:26 UTC