Re: Certificate Triplify Challenge from Kingsley Idehen on 2012-01-11 (public-xg-webid@w3.org from January 2012)

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Wed, 11 Jan 2012 11:21:29 -0500
To: public-xg-webid@w3.org
Message-ID: <4F0DB709.6020601@openlinksw.com>
On 1/11/12 11:02 AM, Henry Story wrote:
> On 4 Jan 2012, at 19:25, Peter Williams wrote:
>
>> Changing the encoding of the cert format on the wire makes no difference.
> Makes no difference to what?
>
> It could  make a difference to many issues we have discussed here, because it would help us to bring in some powerful logical tools from the semantic web space to the discussion.
>
> For example it  can help us resolve what is behind  Kingsley's whole issue with ambiguity. The ambiguity he is seeing is very likely not in the URIs where he is looking for it in vain, but more likely in an interpretation of X509.
>
> I put up my result from the triply challenge on the wiki here:
>
>    http://www.w3.org/2005/Incubator/webid/wiki/X509Semantics
>
> In particular Kingsley wants has an issue with the SAN because it is a name. This is in fact no problem for http URIs either with # or without (with 303 redirects). But it is a problem for e-mail identifier, or rather let us say it depends on how you "GRDDL" your X509.
>
> Let us say you do this:
>
> <http://example.com/cert>  a cert:Certificate ;
>         log:semantics {
>            <http://example.com/cert>  foaf:primaryTopic _:agent ;
>            _:agent cert:distinguishedName [
> 		a cert:DistinguishedName ;
> 		x520:countryName "GB" ;
> 		x520:localityName "London" ;
> 		x520:organizationName "British Broadcasting Corporation" ;
> 		x520:organizationalUnitName "Research and Development" ;
> 		x520:commonName "Test Certificate" ;	
>            ] ;
>           owl:sameAs<http://example.com/me#person>  .
>           cert:key [ a cert:RSAPublicKey ;
> 		…
>    	  ] ;
>     }
>
> Now imagine we could turn a DN into a  URL, so let's imagine we have some URL for DNS that works. Then we could write
> equivalently
>
> <http://example.com/cert>  a cert:Certificate ;
>         log:semantics {
>         <http://example.com/cert>  foaf:primaryTopic<DN:/CN=TestCertificate/OU=Research+and+Development/...>  .
>       ...
>      <DN:/CN=TestCertificate/OU=Research+and+Development/...>  owl:sameAs<http://example.com/me#person>  .
> }
>
> Ok, so that does indeed seem to capture the notion of an AlternativeName and the notion that the DN is the Subject Identifier.
>
> The problem is with e-mails. Because there it does not work quite so nicely. You could not write simply
>
> <http://example.com/cert>  a cert:Certificate ;
>         log:semantics {
>         <http://example.com/cert>  foaf:primaryTopic<DN:/CN=TestCertificate/OU=Research+and+Development/...>  .
>       ...
>      <DN:/CN=TestCertificate/OU=Research+and+Development/...>
>                owl:sameAs<http://example.com/me#person>,
>                           <mailto:me@example.com>  .
> }
>
>
> Why? Because mailto urls refer to mailboxes not to people. So this means that if you put the mailto URL in that position in that way then unless you create a special mapping from mailto urls to say { _p foaf:mbox<mailto:me@example.com>  . } and then use _p a blank node as the identifier, then you have a problem.
>
> If you do that then you have another problem though because if it were<mailto:me@emable.com#me>  then what would you do?
> So Kingsley's problem is that from the syntax of the X509 it is not clear - to our current research - what the solution here should be.
>
> It could just be that here X509 is ambiguous. The people who developed this just were not thinking that carefully about what they meant by SAN. Or one could say the mapping of the relation between the DN and the SAN to owl:sameAs is perhaps too strict. Well there is room for interpretation here.
>
> So it is not because something is ambiguous that you cannot later clarify it. By pushing people to use https urls with a hash we are I think squarely within the best interpretation of what a SAN is. When we move to e-mail addresses things get a big more awkward. But for the moment this is not such a big deal for us, because we are not trying to deal with e-mail addresses.
>
> But it is a problem for Kingsley because he wants to make the California crowd that is adamant about e-mail addresses happy.

Wow!

I want to make the Internet crowd happy. I want to exercise the 
ingenuity inherent in URI abstraction.

What's wrong with the following in SAN?

URI=http://id.myopenlink.net/dataspace/person/KingsleyUyiIdehen
RFC822 Name=kidehen@openlinksw.com
> They could use the accnt scheme<mailto:me@example.com>  and that would probably get a bit closer to us, depending on how you think of an account.

That's a mailto: scheme URI. They could also use acct: of course. Most 
important of all it could be any URI. The issue is the resolution 
mechanism that enables all the action occur from SAN.

My example above separates the Name Identifier from the Resource Locator 
(Address) Identifier.

Hammer Stack covers the matter at hand. You can ignore it, but that's 
just being unrealistic about reality when the Internet is the domain of 
focus.
>
> On the other hand one could just say, well X509 was never meant to be that coherent, so we can just be flexible here. And then things should fall in line again.

That's the point! Be flexible and the URI abstraction with do its thing. 
The scale is Internet scale not WWW scale. HTTP is about the WWW, which 
is an active part of the Internet. No matter how you cut it, now matter 
how useful it is etc.. it is still part of the Internet.

URIs solve an Internet scale problem, not a WWW scale problem. Linked 
Data doesn't have to be WWW scoped, it can work at Internet scale too.

The end destination is inevitable. WebID or NetID, note, I've seen this 
movie before.

I say "Check!" so your move next :-)


Kingsley
>
> Henry
>
>> You can spit it out in long RDF strings if you want. ASN.1 doesnt care whether you use DER, BER, PER, or XML. ISO defined the mappings onto XML, and compilers now spit out bytes - in binary XMl or long XML. One can define a spitter for RDF in any of tis encodings, if one wants. Any undergrad can do this (its just 1980s abstract/concrete type theory).
>>
>> But, Henry is right, that this makes NO difference. Its still an ASN.1 cert, with particular set of type theory formalisms, that REALLY DO NOT WORK well with RDF/EAV (which is very pure). The semantics are abstract, and are not tied to the encoding.
> yes, that is an interesting feature of ASN.1 . Of course the signature does tie the whole thing down to a particular format.
>
>>
>> The esmantics for the cert AS A TYPE (not a blog) are also very much tied to the art of public key distribution on an internet sclae - whose PRACTICAL  security requires a particular way of relying on naming and addressing, and binding, and asserting, and validating, and (all the other things folks discuss here). The cert is just the lynch pin of that doctrine set (which is why folks discuss it endless, often in rant form). its why it gets the "evil' label (becuase it SO good at actualy doing what folks would LIKE to do, when replacing it). yes its getting old, as its tied to internet 1 (which is getting on in age).
>>
>> In my R&D work, I put triple and sparql expression in the cert in the form of test of encoded into URIs), and avoid the whole SAN URI semantic wars. I thus describe identity the webby way from the outset, using the cert now NOT as above (but as a means to end, so https libs work). Its just a signed text stream, then, retaining some legacy key management controls so actual SSL is not compromised, too badly
>>
>>
>> Back in 2007, the topics that looked interesting included:
>>
>> http://yorkporc.wordpress.com/2007/09/30/copy-of-httpdarq-sourceforge-net-federated-queries-with-sparql/
>>
>> http://yorkporc.wordpress.com/2007/09/23/email-post-on-using-deriving-sparql-queries-from-foaf-knows-relations-to-assure-pubkeys/
>>
>> But, its takes SO LONG to do anthing, in semweb land, that I hardly remember even knowing what I knew then. Kingsley is reminding me, though.
> yes, the myth that since the internet everything moves 7 times faster is a myth.
> I myself thought that people would do things if one told them. But it turns out you have to do it yourself.
>>> From: henry.story@bblfish.net
>>> Date: Wed, 4 Jan 2012 19:04:48 +0100
>>> CC: j.jakobitsch@semantic-web.at; public-xg-webid@w3.org
>>> To: mo.mcroberts@bbc.co.uk
>>> Subject: Re: Certificate Triplify Challenge
>>>
>>>
>>> On 4 Jan 2012, at 16:05, Mo McRoberts wrote:
>>>
>>>> On 4 Jan 2012, at 13:50, Henry Story wrote:
>>>>
>>>>> As soon as you put things this way you realise that it is wrong in fact. Because the above fails to make the point that it is the Certificate that is making the agent claims. What is really needed there is to use N3 to express what is going on:
>>>> Hmm, are you sure? Is it not that the certificate *carries* the claims made by the issuer?
>>> A certificate is a document that is signed by an issuer. It is exactly the type of thing that has a semantics. In fact one could even say that a document is defined by its having a semantics. ( Btw. log:semantics is explained in more detail here http://www.w3.org/2000/10/swap/doc/Reach )
>>>
>>> So let say I speak of a certificate<http://example.com/cert>  here in this e-mail, then I can say what type of thing it is, when it was made etc... I can make statements about that document.
>>> But I can't speak about the contents of that certificate without asserting those contents themselves here. And anything or anyone reading what I am writing here would not know how to distinguish between what I am saying and what the certificate is saying, unless you use graphs or if you wanted to make your life really complicated reification. Ie one needs a quotation mechanism. In N3 you do this with {...}.
>>>
>>> There is one particularly interesting exception I think and that is if the document<http://example.com/cert>  were to return also an RDF representation, and this would then be written something like
>>>
>>> <>  a cert:Certificate ;
>>> foaf:primaryTopic _:agent ;
>>> cert:issuer<http://example.com/ca#cert>  ;
>>> cert:serialNumber 1 ;
>>> cert:notBefore "2012-01-01T14:00:00Z"^^xsd:dateTime ;
>>> cert:notAfter "2012-12-31T13:59:59Z"^^xsd:dateTime ;
>>> cert:extension [
>>> a cert:basicConstraints ;
>>> cert:extensionValue [
>>> cert:ca "false"^^xsd:boolean ;
>>> cert:pathLengthConstraint 0 ;
>>> ] ;
>>> ] ;
>>> cert:signatureAlgorithm cert:sha1WithRSAEncryption ;
>>> cert:signature "00010203040506070809...."^^xsd:hexBinary .
>>>
>>> _:agent cert:distinguishedName [
>>> a cert:DistinguishedName ;
>>> x520:countryName "GB" ;
>>> x520:localityName "London" ;
>>> x520:organizationName "British Broadcasting Corporation" ;
>>> x520:organizationalUnitName "Research and Development" ;
>>> x520:commonName "Test Certificate" ;	
>>> ] ;
>>> cert:key [ a cert:RSAPublicKey ;
>>> …
>>> ] ;
>>> owl:sameAs<http://example.com/me#person>  .
>>>
>>>
>>> ( Well the only problem is that the signature would have to be outside the document, in another document presumably, because signing a document with an internal signature is a complicated trick. One would need a signature algorithm that removes certain triples - the signature tripes - before signing. And this may have many issues I don't know about )
>>>
>>> Here the document is speaking about itself and its contained statements, so it is clear what the signature is about, and also what it is that the certificate in ASN.1 is saying. We have essentially a sketch of an RDF view on the X509 document here.
>>>
>>>> If the purpose of the ontology is to allow round-tripping (which it must, IMO, so that you can verify the signature on the content — otherwise you might as well just have a lump of arbitrary signed RDF and forget about bothering with X.509's structure), then you have to be careful about how far you diverge from it, and that includes additional statements (which from a processor's perspective are just unsigned additional junk, like a comment header field in a PEM-formatted blob).
>>>>
>>>>> then one realises that the MUST understand stamens are statements about grammar changes: they are saying that you cannot believe anything else about what you see in the document unless you understand one statement: i.e., that statement could possibly change the meaning of the other statements seen up to then.
>>>> Yes… the criticality aspect of extensions falls into this category, although in X.509-land the rules assume that you do know how to process “an extension” in general and where to find the criticality field at a minimum — so with an RDF equivalent you could work on the same basis (i.e., you recognise cert:critical, and if it's set and you don't understand one of the classes associated with the extension, fail).
>>>>
>>>> M.
>>>>
>>>> -- 
>>>> Mo McRoberts - Technical Lead - The Space,
>>>> 0141 422 6036 (Internal: 01-26036) - PGP key CEBCF03E,
>>>> Project Office: Room 7083, BBC Television Centre, London W12 7RJ
>>>>
>>>>
>>>>
>>>> http://www.bbc.co.uk/
>>>> This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated.
>>>> If you have received it in error, please delete it from your system.
>>>> Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately.
>>>> Please note that the BBC monitors e-mails sent or received.
>>>> Further communication will signify your consent to this.
>>>>
>>> Social Web Architect
>>> http://bblfish.net/
>>>
>>>
> Social Web Architect
> http://bblfish.net/
>
>
>
ca

-- 

Regards,

Kingsley Idehen	
Founder&  CEO
OpenLink Software
Company Web: http://www.openlinksw.com
Personal Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca handle: @kidehen
Google+ Profile: https://plus.google.com/112399767740508618350/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen
Attachments

application/pkcs7-signature attachment: S/MIME Cryptographic Signature
Received on Wednesday, 11 January 2012 16:21:53 UTC