Copyright © 2002 W3C® (MIT, INRIA, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply.
This document specifies an XML Signature "decryption transform" that enables XML Signature applications to distinguish between those XML Encryption structures that were encrypted before signing (and must not be decrypted) and those that were encrypted after signing (and must be decrypted) for the signature to validate.
This is an editors' draft with no standing.
This specification from the XML Encryption Working Group (Activity) is a Candidate Recommendation of the W3C. None of the last call issues on the XML Encryption specifications concerned this specification. Furthermore, the WG considers this specification to be stable and invites implementation feedback during this period.
The exit criteria for this phase is at least two interoperable implementations of this transform with acceptable performance. The interoperability of this specification will be demonstrated as an algorithm in the XML Encryption Syntax and Processing Interoperability Report. We expect to meet all requirements of that report within the two month Candidate Recommendation period (closing April 25). Specific areas where we would appreciate further experience are:
Publication of this document does not imply endorsement by the W3C membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite a W3C Working Draft as anything other than a "work in progress." A list of current W3C working drafts can be found at http://www.w3.org/TR/.
Please send comments to the editors (<merlin@baltimore.ie>, <imamu@jp.ibm.com>, and <maruyama@jp.ibm.com>) and cc: the list xml-encryption@w3.org (publicly archived).
Patent disclosures relevant to this specification may be found on the Working Group's patent disclosure page in conformance with W3C policy.
It has been noted by David Solo in [Solo] that both signature [XML-Signature] and encryption [XML-Encryption] operations may be performed on an XML document at any time and in any order, especially in scenarios such as workflow. For example, Alice wishes to order and pay for a book from Bob using the mutually trusted payment system ZipPay. Bob creates an order form including the book title, price and his account info. He wants to sign all of this information, but will subsequently encrypt his account info for ZipPay only. He sends this to Alice who affirms the book title and price, signs the form and presents the twice-signed order with her own payment information to ZipPay. To validate both signatures ZipPay will have to know that the cipher data version of the encrypted information is necessary for validating Alice's signature, but the plain data form is necessary for validating Bob's signature. (See "Sign What You See" (section 5.2) for more on signing encrypted data.)
Since encryption operations applied to part of the signed content after a signature operation cause a signature not to be verifiable, it is necessary to decrypt the portions encrypted after signing before the signature is verified. The "decryption transform" proposed in this document provides a mechanism; decrypting only signed-then-encrypted portions (and ignoring encrypted-then-signed ones). A signer can insert this transform in a transform sequence (e.g., before Canonical XML [XML-C14N] or XPath [XPath]) if there is a possibility that someone will encrypt portions of the signature.
The transform defined in this document is intended to propose a resolution
to the decryption/verification ordering issue within signed resources. It is
out of scope of this document to deal with the cases where the ordering can
be derived from the context. For example, when a ds:DigestValue
element or a (part of) ds:SignedInfo element is encrypted, the
ordering is obvious (without decryption, signature verification is not
possible) and there is no need to introduce a new transform.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 [Keywords].
This document makes use of the XML Encryption [XML-Encryption] and XML Signature [XML-Signature] namespaces, and defines it own, with the following prefixes:
xmlns:xenc="http://www.w3.org/2001/04/xmlenc#" xmlns:ds="http://www.w3.org/2000/09/xmldsig#" xmlns:dcrpt="http://www.w3.org/2001/04/decrypt#"
While implementations MUST support XML and XML namespaces, the use of our
"xenc", "ds", and "dcrpt" XML
namespace prefixes is OPTIONAL; we use this facility to provide compact and
readable exposition.
The contributions of the following Working Group members to this specification are gratefully acknowledged:
This transform supports two modes of operation: In XML mode, the encrypted data must constitute XML data, and the
result of this transform is the node set that results from their decryption. In binary mode, the result is the raw octet stream
that results from the decryption process. In both modes, xenc:EncryptedData elements in the input node-set can be
excluded from processing using dcrpt:Except elements. dcrpt:Except is defined below via XML Schema [XML-Schema] and appears as direct child elements of the ds:Transform element.
The REQUIRED URI attribute value of the
dcrpt:Except element MUST be a non-empty same-document URI
reference [URI] (i.e., a number sign ('#') character
followed by an XPointer expression [XPointer] (as
profiled by The
Reference Processing Model (section 4.3.3.2) of the XML Signature
specification [XML-Signature])) and identify
xenc:EncryptedData elements within the input to the
transform.
  Schema Definition:
  <?xml version="1.0" encoding="utf-8"?>
  <!DOCTYPE schema PUBLIC "-//W3C//DTD XMLSchema 200102//EN"
    "http://www.w3.org/2001/XMLSchema.dtd" [
    <!ATTLIST schema
      xmlns:dt CDATA #FIXED "http://www.w3.org/2001/04/decrypt#">
    <!ENTITY % p ''>
    <!ENTITY % s ''>
  ]>
  <schema xmlns="http://www.w3.org/2001/XMLSchema" version="0.1"
          xmlns:dt="http://www.w3.org/2001/04/decrypt#"
          targetNamespace="http://www.w3.org/2001/04/decrypt#"
          elementFormDefault="qualified">
    <element name="Except" type="dt:ExceptType"/>
    <complexType name="ExceptType">
      <attribute name="Id" type="ID" use="optional"/>
      <attribute name="URI" type="anyURI" use="required"/>
    </complexType>
  </schema>
This mode of the transform requires an XPath node-set [XPath] for
input. If an octet stream is given as input, it MUST be converted to a
node-set as described in The
Reference Processing Model (section 4.3.3.2) of the XML Signature
specification [XML-Signature]. The transform
decrypts all the xenc:EncryptedData elements except for those
specified by dcrpt:Except elements.
The output of the transform is a node-set.
This section describes the processing rules of the transform. The rules are written as two functions; the inputs and outputs of the transform are those of the decryptXML() function, which itself calls the decryptNodeSet() function.
where N is a node-set and E is a set of
      exception URIs held by URI attributes of
      dcrpt:Except elements. O is a node-set, computed
      as follows:
Dereference each exception URI from E in the context of the owner document of N, resulting in a set of XPointer location-sets [XPointer] X.
xenc:EncryptedData element d and
	  its descendants, process the replacement node-set
	  Od, from Y, with the
	  following additional rules:
	  xmlns="" MUST be
                  emitted with every apex element that has no namespace
                  prefix and URI as described in Serializing
                  XML (section 4.3.3) of the XML Encryption
                  specification [XML-Encryption].xenc:EncryptedData element causing a
              parsing error.xenc:EncryptedData elements, then N is
              still canonicalized and parsed.where N is a node-set and E is a set of
      exception URIs held by URI attributes of
      dcrpt:Except elementsX is
      a set of location-sets. Y is a set of
      node-sets and/or octet streams, computed as follows:
xenc:EncryptedData that are
          not identified by any exception URI in Exenc:EncryptedData element, only location-sets in
              X resulting from exception URIs with
              a full XPointer "xpointer(id('ID'))" or bare name [XPointer] are considered.xenc:EncryptedData elements with
	      matching Id attributes.xenc:EncryptedData element d from
          D: 
          Type attribute, resulting in
              a node-set xenc:EncryptedData element with the
                  Type attribute whose value is &xenc;Element
                  or &xenc;Content
                  is specified in A
                  Decrypt Implementation (section 4.3.1) of the XML
                  Encryption specification [XML-Encryption], and the result
                  is a node-set.Type attribute is absent or its value
                  is neither &xenc;Element
                  nor &xenc;Content,
                  the result is an octet stream. Type attribute is absent, is not known
		  to the decryptor, or the result of its processing is not a
		  node-set, then the implementation MUST signal
		  a failure of the transform.xenc:EncryptedData
                  element fails, then the implementation MUSTxenc:EncryptedData element and continue
                  processing.xenc:EncryptedData element.It is out of scope of this document how to create a
ds:Transform element and where to insert it in a transform
sequence. In this section, we just show a way to create the element as an
advisory.
A ds:Transform element can be created by the following
steps:
xenc:EncryptedData, create an dcrpt:Except
    element referencing the node.ds:Transform element, including the algorithm
    identifier of this transform and all the dcrpt:Except
    elements created in Step 3.[Include xml:foo attributes in example]
Suppose that a part of the following XML document ([02-14])
is to be signed. Note that a few parts of the document
([05,11,12]) are already encrypted prior to signature. Also
suppose that the signer anticipates that additional parts of the document
will be encrypted after signing.
[01] <Document> [02] <ToBeSigned Id="tbs"> [03] <Part number="1"> [04] <Data>...</Data> [05] <xenc:EncryptedData Id="#secret-1" .../> [06] </Part> [07] <Part number="2"> [08] <Data>...</Data> [09] </Part> [10] <Secrets> [11] <xenc:EncryptedData .../> [12] <xenc:EncryptedData .../> [13] </Secrets> [14] </ToBeSigned> [15] </Document>
In order to let the recipient know the proper order of decryption and
signature verification, the signer includes the decryption transform
([19-22]) in the signature. The dcrpt:Except
elements ([20,21]) identify parts of the document that are
already encrypted.
  [a01] <Document>
  [a02]   <ToBeSigned Id="tbs">
  [a03]     <Part number="1">
  [a04]       <Data>...</Data>
  [a05]       <xenc:EncryptedData Id="#secret-1" .../>
  [a06]     </Part>
  [a07]     <Part number="2">
  [a08]       <Data>...</Data>
  [a09]     </Part>
  [a10]     <Secrets>
  [a11]       <xenc:EncryptedData .../>
  [a12]       <xenc:EncryptedData .../>
  [a13]     </Secrets>
  [a14]   </ToBeSigned>
  [a15]   <dsig:Signature ...>
  [a16]     ...
  [a17]     <dsig:Reference URI="#tbs">
  [a18]       <dsig:Transforms>
  [a19]         <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#XML">
  [a20]           <dcrpt:Except URI="#secret-1"/>
  [a21]           <dcrpt:Except URI="#xpointer(id('tbs')/Secrets/*)"/>
  [a22]         </dsig:Transform>
  [a23]       </dsig:Transforms>
  [a24]       ...
  [a25]     </dsig:Reference>
  [a26]     ...
  [a27]   </dsig:Signature>
  [a28] </Document>
Consider that this document is subsequently encrypted by various processes, resulting in the following:
  [b01] <Document>
  [b02]   <ToBeSigned Id="tbs">
  [b03]     <xenc:EncryptedData Id="part-1" Type="&enc;Element" .../>
  [b04]     <xenc:EncryptedData Id="part-2" Type="&enc;Element" .../>
  [b05]     <Secrets>
  [b06]       <xenc:EncryptedData .../>
  [b07]       <xenc:EncryptedData .../>
  [b08]     </Secrets>
  [b09]   </ToBeSigned>
  [b10]   <dsig:Signature ...>
  [b11]     ...
  [b12]     <dsig:Reference URI="#tbs">
  [b13]       <dsig:Transforms>
  [b14]         <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#XML">
  [b15]           <dcrpt:Except URI="#secret-1"/>
  [b16]           <dcrpt:Except URI="#xpointer(id('tbs')/Secrets/*)"/>
  [b17]         </dsig:Transform>
  [b18]       </dsig:Transforms>
  [b19]       ...
  [b20]     </dsig:Reference>
  [b21]     ...
  [b22]   </dsig:Signature>
  [b23] </Document>
Execution of the decryption transform will proceed as follows:
ToBeSigned element and its children, less comments
    ([b02-b09]). The parameter to the transform, E, is
    a set containing the two exception URIs ([b15,b16]).Secrets element
    ([b06,b07]); this is the exception location-set in
    X.xenc:EncryptedData elements, dpart-1
    ([d03]) and dpart-2
    ([d04]). Each of these is decrypted, resulting in the
    following node-sets for Opart-1 and
    Opart-2:[c01] <Part number="1"> [c02] <Data>...</Data> [c03] <xenc:EncryptedData Id="#secret-1" .../> [c04] </Part>
[d01] <Part number="2"> [d02] <xenc:EncryptedData Id="#data-2" Type="&enc;Element" .../> [d03] </Part>
Note that part of the second node-set
  ([d02]) has been
  super-encrypted.
xenc:EncryptedData
    elements ([c03] and [d02]) have been revealed.
    However, the first matches an exception URI with a bare name and so is
    not considered further; hence, D for Opart-1
    is empty while D for Opart-2 contains just
    the xenc:EncryptedData element ddata-2
    ([d02]). This is decrypted again, resulting in the following
    node-set Odata-2:[e01] <Data>...</Data>
xenc:EncryptedData element are revealed, so
    D for Odata-2 is empty and processing falls
    through to canonicalization.xenc:EncryptedData elements
    that were decrypted, it canonicalizes the replacement node-sets.
    Similarly, it also replaces any decrypted xenc:EncryptedData
    elements in the replacement node-sets. Further, canonicalization of any
    replacement node-sets is augmented such that xmlns="" is
    emitted on any apex elements that have no namespace prefix and URI. The
    resulting canonicalized data are the following:[f01] <Document> [f02] <ToBeSigned Id="tbs"> [f03] <Part xmlns="" number="1"> [f04] <Data>...</Data> [f05] <xenc:EncryptedData Id="#secret-1" .../> [f06] </Part> [f07] <Part xmlns="" number="2"> [f08] <Data xmlns="">...</Data> [f09] </Part> [f10] <Secrets> [f11] <xenc:EncryptedData .../> [f12] <xenc:EncryptedData .../> [f13] </Secrets> [f14] </ToBeSigned> [f15] </Document>
xenc:EncryptedData
    element being decrypted, then this SHOULD correspond to an encrypted single-rooted node-set.
    However, this need not be the case: after decryption, multiple top-level
    nodes may be well-formed if they consist of whitespace, comments,
    processing instructions and a single element. No special processing is
    required to test for this condition because ill-formed data will result
    in a parsing error.xenc:EncryptedData element uses same-document references, or
    if an exceptional super-encrypted xenc:EncryptedData element
    is referenced by a non-bare name XPointer URIAs an example of where super-encryption over same-document references
    may cause problems, consider the following signed document ([02-05]):
[01] <Document> [02] <ToBeSigned Id="tbs"> [03] <Data>...</Data> [04] ... [05] </ToBeSigned> [06] <dsig:Signature ...> [07] ... [08] <dsig:Reference URI="#tbs"> [09] <dsig:Transforms> [10] <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#XML" /> [11] </dsig:Transforms> [12] ... [13] </dsig:Reference> [14] ... [15] </dsig:Signature> [16] ... [17] </Document>
If the Data element ([03]) is subsequently encrypted, along with
    other data elsewhere in the document, the new xenc:EncryptedData
    element could use a same-document retrieval method to identify shared keying
    information ([a06]):
[a01] <Document> [a02] <ToBeSigned Id="tbs"> [a03] <xenc:EncryptedData ...> [a04] ... [a05] <dsig:KeyInfo ...> [a06] <dsig:RetrievalMethod URI="#key-info" .../> [a07] </dsig:KeyInfo> [a08] ... [a09] </xenc:EncryptedData> [a10] ... [a11] </ToBeSigned> [a12] <dsig:Signature ...> [a13] ... [a14] <dsig:Reference URI="#tbs"> [a15] <dsig:Transforms> [a16] <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#XML" /> [a17] </dsig:Transforms> [a18] ... [a19] </dsig:Reference> [a20] ... [a21] </dsig:Signature> [a22] </Document>
If this new xenc:EncryptedData is subsequently super-encrypted
    ([b02]), the decryption transform will fail: When the inner retrieval
    method is processed, it will be dereferenced within the context of a new decrypted
    document that does not contain the referenced keying informatiohn.
[b01] <Document> [b02] <xenc:EncryptedData Id="tbs" .../> [b03] <dsig:Signature ...> [b04] ... [b05] <dsig:Reference URI="#tbs"> [b06] <dsig:Transforms> [b07] <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#XML" /> [b08] </dsig:Transforms> [b09] ... [b10] </dsig:Reference> [b11] ... [b12] </dsig:Signature> [b13] </Document>
As an example of where non-barename XPointers may fail, consider the
    following signed document ([02-07]) which uses
    a full XPointer ([13]) to identify data that were already
    encrypted when the signature was generated ([05]), and so
    should not be processed by the decryption transform:
[01] <Document> [02] <ToBeSigned Id="tbs"> [03] <Data>...</Data> [04] <Secrets> [05] <xenc:EncryptedData Id="#secret-1" .../> [06] </Secrets> [07] </ToBeSigned> [08] <dsig:Signature ...> [09] ... [10] <dsig:Reference URI="#tbs"> [11] <dsig:Transforms> [12] <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#XML"> [13] <dcrpt:Except URI="#xpointer(/Document/ToBeSigned/Secrets/*)"/> [14] </dsig:Transform> [15] </dsig:Transforms> [16] ... [17] </dsig:Reference> [18] ... [19] </dsig:Signature> [20] </Document>
If the Secrets element is subsequently encrypted, as
    shown in the following example ([a04]); i.e.,
    the existing xenc:EncryptedData is super-encrypted, then
    the XPointer exception URI will no longer resolve. As a result, the
    decryption transform will attempt to perform super-decryption of the
    inner xenc:EncryptedData element and processing will fail.
[a01] <Document> [a02] <ToBeSigned Id="tbs"> [a03] <Data>...</Data> [a04] <xenc:EncryptedData Id="#secrets" .../> [a05] </ToBeSigned> [a06] <dsig:Signature ...> [a07] ... [a08] <dsig:Reference URI="#tbs"> [a09] <dsig:Transforms> [a10] <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#XML"> [a11] <dcrpt:Except URI="#xpointer(/Document/ToBeSigned/Secrets/*)"/> [a12] </dsig:Transform> [a13] </dsig:Transforms> [a14] ... [a15] </dsig:Reference> [a16] ... [a17] </dsig:Signature> [a18] </Document>
The reason that the full XPointer cannot be processed is that the
    document which results from decrypting the outer xenc:EncryptedData
    element will have the following form ([b01-b05]):
[b01] <dummy> [b02] <Secrets> [b03] <xenc:EncryptedData Id="#secret-1" .../> [b04] </Secrets> [b05] </dummy>
For example, consider the following document which contains
    parts that are to be signed for receipt by different individuals.
    The XPath transform ancestor-or-self::*[@For="job"]
    selects only the subset [02-04]. If this XPath transform
    occurs after the decryption transform, and another
    part of the document contains encrypted data (e.g., [07]),
    whether created before or after the signature, then the decryption
    transform may fail to decrypt them and processing will fail. If
    the XPath transform occurs first, then the encrypted data will not
    be considered by the decryption transform.
[01] <Document> [02] <ToBeSigned For="job"> [03] ... [04] </ToBeSigned> [05] <ToBeSigned For="bob"> [06] ... [07] <xenc:EncryptedData .../> [08] </ToBeSigned> [09] </Document>
#xpointer(/ToBeSigned/*[3]) will no longer function
    if the first two children of the ToBeSigned element are
    encrypted together. Care SHOULD be taken when employing such references
    in association with the decryption transform.xenc:EncryptedKey
    elements within its scope of specifically indicating elements, and their
    exceptions, that should be decrypted. An xenc:EncryptedKey
    element that exists as a descendent of xenc:EncryptedData
    element might be decrypted and will be removed from the original document
    as part of processing its ancestor xenc:EncryptedData
    element with the transform. However, a lone
    xenc:EncryptedKey element will be processed like any other
    data: a signature is presumed to be over that actual element and not its
    decrypted form. Consequently, we RECOMMEND that
    xenc:EncryptedKey elements always be children of an
    xenc:EncryptedData element's ds:KeyInfo element
    when they fall within the scope of a signature.This mode of the transform requires an XPath node-set [XPath] for
input. If an octet stream is given as input, it MUST be converted to a
node-set as described in The
Reference Processing Model (section 4.3.3.2) of the XML Signature
specification [XML-Signature]. The transform
decrypts all the xenc:EncryptedData elements except for those
specified by dcrpt:Except elements.
The output of the transform is an octet-stream.
The binary mode of operation is intended for use when generating a signature over binary data that are to be encrypted for transmission to the recipient. Use of this mode of the transform allows a signature to be computed over the plaintext form of the data, rather than the opaque ciphertext. This further allows the ciphertext to be stored elsewhere, identified by a cipher reference, without the need for the signature to take this into account.
This section describes the processing rules of the binary mode of this transform. The inputs and outputs of the transform are those of the decryptBinary() function.
where N is a node-set and E is a set of
      exception URIs held by URI attributes of
      dcrpt:Except elements. O is an octet stream, computed
      as follows:
xenc:EncryptedData that are
          not identified by any exception URI in E.
          xenc:EncryptedData element d from
          D, decrypt d, without regard for which, if any, of its
          descendants are in N, and without consideration of
          its Type attribute, resulting in
          an octet-stream Od.EncryptedData elements in D,
          then the result is a zero-length octet stream.Consider the following example signed document:
<Document>
  <xenc:EncryptedData Id="image" MimeType="image/png" ...>
    ...
    <!-- image data -->
    ...
  </xenc:EncryptedData>
  <dsig:Signature ...>
    ...
    <dsig:Reference URI="#image">
      <dsig:Transforms>
        <dsig:Transform Algorithm="http://www.w3.org/2001/04/decrypt#Binary" />
      </dsig:Transforms>
      ...
    </dsig:Reference>
    ...
  </dsig:Signature>
</Document>
Much of the encrypted data and signature are elided; the implication of the comment in the encrypted data is that the encrypted content is a binary image.
Execution of the decryption transform will proceed as follows:
EncryptedData element and its children, less
    comments. The parameter to the transform, E, is empty.EncryptedData element,
    dimage. This is decrypted, resulting
    in an octet string Oimage containing
    the plaintext of the binary image.When this algorithm is used to facilitate subsequent encryption of data
already signed, the digest value of the signed resource still appears in
clear text in a ds:Reference element. As noted by Hal Finney in
[Finney], such a signature may reveal information (via
the digest value) over encrypted data that increases the encryption's
vulnerability to plain-text-guessing attacks. This consideration is out of
scope of this document and (if relevant) should be addressed by applications.
For example, as proposed by Amir Herzberg in [Herzberg], one may include a random 'salt' in a resource
being signed to increase its entropy.
Another approach is that when a signature referent is encrypted, one may
also encrypt the signature (or at least the ds:DigestValue
elements). As noted by Joseph Reagle in [Reagle], this
latter solution works only if signature and encryption are well known by each
other. For example, the signature may not be known of because it is detached.
Or, it may be already encrypted! Consider, Alice Encrypts element A and the
Signature over the parent of A. Bob Encrypts element B (sibling of A) but not
the Signature since he doesn't know about it. Alice then decrypts A and it's
Signature, which may provide information to a subsequent plain text attack on
the encrypted B.
This specification serves scenarios in which a person might sign encrypted data. Because XML Signature [XML-Signature] has only a simple semantic whereby a key is associated with some data -- and nothing more -- the signing of encrypted data is a legitimate process. For example, someone might run a content-neutral time stamp service that will sign any data sent to it with its time-stamping key under the semantic, "I received this on $date $time." However, applications often explicitly or implicitly associate more substantive semantics (e.g., authorizes, agrees, authors) with a signature. No one should be asked to apply a signature and its semantic to data he or she did not see. Just as the principles of Only What is 'Seen' Should be Signed and 'See' What is Signed are important for understanding the import of an XML Signature, they are doubly important when semantics are associated with that signature: one MUST NOT infer that a signature over encrypted data is also a signature over its plain text form, nor that the meaning of that signature over the encrypted data also applies to the plain text. If one wishes to sign the plain text form of data which is later encrypted, use the transform specified in this document!