ISSUE-32: Should hydra:returns and hydra:statusCodes be removed to avoid tight coupling? (was: More Thoughts on Links and Operation Subclasses) from Markus Lanthaler on 2014-02-04 (public-hydra@w3.org from February 2014)

From: Markus Lanthaler <markus.lanthaler@gmx.net>
Date: Tue, 4 Feb 2014 16:39:46 +0100
To: <public-hydra@w3.org>
Message-ID: <022a01cf21bf$56dfc750$049f55f0$@lanthaler@gmx.net>
OK, this is the second thread. This one is trying to find an answer to the
following question:

  "Should hydra:returns and hydra:statusCodes be removed to avoid tight
coupling?" - ISSUE-32 


On Tuesday, February 04, 2014 2:11 AM, Ryan J. McDonough wrote:
> On Feb 3, 2014, at 3:19 PM, Markus Lanthaler wrote:
> > On Friday, January 31, 2014 4:50 PM, Ryan J. McDonough wrote:
[...]
> Look at some of the comments on the old Facebook API. You have people
> whining because they expected a 200 (Ok) and get an image but instead
> they got a 303 or 302 instead and they're all perplexed. A good number
> of devs will take the documentation as fact rather than look at what's
> coming back in the response. Intermediaries on the other hand will
> never look at your documentation and will always look at the headers
> and message body. Some would argue that the client is also an
> intermediary.

Hmm.. that's a very good point. So in your opinion no additional information
about the returned status codes is necessary? Not even at the API level? For
example, to make it clear that if the quota limit is hit a "402 Payment
Required" is returned instead of a "429 Too Many Requests"?


[...]

> >> The fact that HTML doesn't concern the browser with things like returns
> >> types and potential response codes is one of the things that makes the
> >> web work today. Beating the horse a little more, consider a checkout
> >> process whereby one of my payment options use PayPal and I'm sending
> >> data via POST to PayPal's payment API and awaiting the response from
> >> PayPal. I'm going to send the client from my API in my domain, over to
> >> PayPal, where I don't have control over PayPal's API, more importantly
> >> their namespace or response codes. Using my ApiDocumentation to
> >> describe what I think is PayPal's expected response types is recipe for
> >> failure.
> >
> > Then just leave it out :-) It's not required to specify these things.
> 
> You could, but there will be those who are expecting the response types
> because it's in some Hyrda descriptors but not others.

Fair enough. So, in other words, you are saying that clients would fail
because they can't find that information?


> >> Now, without a doubt, HTML forms don't do much to describe what the
> >> form does. HTML rely's on the fact that a Human can parse the text on
> >> the page in order to determine the controls function. In Hydra, we're
> >> trying to get at the machine parseable analog to descriptive text. I
> >
> > Exactly. The solution I chose was to type operations as I felt that's
the
> > simplest solution that a lot of people will understand instinctively.
How
> > would you describe it instead?
> 
> I agree that people will instinctively get this, but HTTP doesn't work
> this way. HTTP requests and have variable response types and Hydra is
> doing what WADL and Swagger do and suggest that there's a single, fixed
> response types.
> 
> Instead, I would recommend that developers work in an asynchronous,
> even-driven fashion and specify handlers to sense and react to the data
> in the response by looking at the headers to determine if the client
> can in fact respond to it. Without a doubt, the service needs to work
> within the constraints of the client. That is, response should likely
> be in JSON-LD and ideally confirm to some type hierarchy defined in the
> service descriptor.

OK. As you know, there are many vocabularies out there. For ecommerce, e.g.,
there are Schema.org and GoodRelations. Now I think it is necessary to
somehow document what kind of types the client should be prepared to handle.
Similar to how you would need to ensure that your client understands the set
of media types used in a specific API. This doesn't have to be described at
the operation level though. We have "supportedClasses" on "ApiDocumentation"
and could leverage that instead.

What do you think about that? Do you find that equally problematic? If so,
where would you start if you were to program a client? Would you crawl the
API? By trial and error?


> I guess it would help to create an example, huh? :)

I think I understand what you mean but examples are always very helpful.

 
[...]

> What I was trying to get at with the profile link header is that if the
> data model is expressed up front and there's sufficient documentation
> about what the types mean, a client can create a number of handlers
> that could react to different responses. If the model is good enough,
> the client could react better to responses that they didn't expect at
> build time.

That sounds like ApiDocumentation/supportedClasses comes close to what you
had in mind. Doesn't it?


> Some developers will read documentation and create code generators that
> the illiterate ones will use. It is here that I feel Hydra will get
> into trouble.

Yeah.. as you know, that's one of my main concerns as well. In the end,
however, I think the only way to avoid that is to implement a generic,
dynamic client that's better than statically generated clients. I know, a
lofty goal :-)


> > I have troubles extracting something actionable from your mails.
> > Would removing returns/statusCodes address your concerns?
> 
> Absolutely!

OK, that's a start. What about statusCodes at the ApiDocumentation level?
Would you remove them as well?

I don't know how much you are into Semantic Web stuff in general, but how do
you feel about rdfs:range in this context? In a sense it is very close to
hydra:returns

  :discussesWith rdf:type hydra:Link ;
                 rdfs:range schema:Person .

  </people/markus> rdf:type schema:Person ;
                   :discussesWith </people/ryan> .



> > If so, what does it really change?
> 
> It'll force developers to look at at what's coming back in the response
> headers rather than what's defined in the Hyrda description. One is a
> hint and the other is fact (i.e. what the server is sending back). By
> removing returns and status, you are now forcing developers to look in
> the right place: the HTTP response headers.
> 
> I have development teams messing things up on a fairly frequent basic
> with WADL and Swagger (seeing a pattern here? :) ) due to the fact that
> they are expecting the server to return exactly what is specified in
> the descriptor and not taking into account they may have to deal with
> both a forward and reverse proxy in the mix.

It's not that trivial to handle all possible responses properly. In a lot
developers simply need/want to get their job done and choose the simplest
route. Surprisingly that works quite well in most cases (I would say more
than the famous 80%). But yeah, I see what you are getting at. I also have
to admit that apart from the natural-language documentation generation use
case I don't see that much value in this information either given that we
have supportedClasses on ApiDocumentation (which again, is just a hint)..
there's also statusCodes there which I still find has some value but I would
need to think more about that.

 
> It could also be the wording. If this is just a hint, then perhaps
> instead of hydra:returns, which sounds a bit more committed than say
> perhaps something more like hyrdra:intimation or perhaps even
> hydra:anticipatedResponseType?

Well, naming is one of those two difficult things in computer science :-) I
don't think it would change much if we would change its name. Perhaps the
stronger signal would be sent by moving this to a separate vocabulary which
adds a couple of other things to facilitate the generation of
natural-language documentations.


> I'm still not sold on status codes. For the most part, everyone is
> going to expect something in the 200 range. There's too many
> exceptional codes to deal with in a format like Hydra to be practical.

Right, there are many. Maybe its again about finding a compromise. I don't
think we would lose much by expressing these things just at the API level
instead of doing so at the operation level. Of course also this could be
moved out of the core vocabulary into a "documentation" vocabulary.


> > IMO it will be just a matter of time till someone else mints
> > a URL for these things in order to, again, be able to transform a
> > Hydra description into a nicely formatted HTML documentation.
> 
> Sure, and that's fine. AAA principle working at it's finest! At least
> no one can point to Hydra Core and blame it for suggesting fixed
> response types :)

:-)


> But seriously, I have to find some time to demo these ideas to better
> illustrate what I'm talking about. That might go a ways in clarifying
> these points.

I feel we already made quite some progress on these recent discussions and
have some concrete options to evaluate:

  - remove "returns" from "Operation"?
  - remove "statusCodes" from "Operation"?
  - remove "statusCodes" from "ApiDocumentation"?
  - move all of them to a separate "documentation generation" vocabulary?

We should also discuss whether "supportedClasses" on "ApiDocumentation" is
enough or perhaps too much :-)


--
Markus Lanthaler
@markuslanthaler
Received on Tuesday, 4 February 2014 15:40:22 UTC