- From: Julian Reschke <julian.reschke@gmx.de>
- Date: Mon, 11 Jul 2016 08:02:55 +0200
- To: Poul-Henning Kamp <phk@phk.freebsd.dk>
- Cc: Yanick Rochon <yanick.rochon@gmail.com>, Phil Hunt <phil.hunt@oracle.com>, HTTP Working Group <ietf-http-wg@w3.org>
On 2016-07-11 00:20, Poul-Henning Kamp wrote: > -------- > In message <176d58df-debf-e660-edf7-7d686c926ef6@gmx.de>, Julian Reschke writes > : > >> It seems you are confusing several issues here: multiple header field >> instances in the HTTP message, and duplicate member names in a JSON >> object. These are completely orthogonal issues. > > Uhm, no? > > They only become orthogonal if whoever specifies the JSON header > takes great care to make that happen. That's what my draft tries to enforce by requiring the data model to be a JSON *array*. > Allowing split headers forces all headers to be defined as JSON > lists, which means we're knee-capping header-designers from the get go. I would call it: "steer them to a format that is robust with regards how header fields can be repeated in messages" > Take this RFC7231 example: > > Accept: text/plain; q=0.5, text/html, text/x-dvi; q=0.8, text/x-c > > If we allow split headers for JSON, this can only be defined as JSON list, > in order that it can also be sent as: > > Accept: <JSON for "text/plain; q=0.5"> > Accept: <JSON for "text/html"> > Accept: <JSON for "text/x-dvi; q=0.8"> > Accept: <JSON for "text/x-c"> > > But that leaves it to the application writer to spot and detect and > handle this degenerate case from a lazy sender: > > Accept: <JSON for "text/plain; q=0.5"> > Accept: <JSON for "text/html"> > Accept: <JSON for "text/x-dvi; q=0.8"> > Accept: <JSON for "text/x-c"> > Accept: <JSON for "text/plain; q=0.0"> The "degenerate" case can happen in a single header field as well. (yes, discussing *simplifiyng* certain header fields is interesting as well, but it again is orthogonal to this discussion) > It's even more complicated for a proxy sender which wants to > modify the text/plain priority: It needs to spot both copies > and change them both (or delete one of them). Yes. > (It seems to me that there is a class of smuggling attacks > where the proxy sees q=0.5 and the server q=0.0, which > RFC7231 does not seem address at all?) Only if it's broken by not processing all field instances, no? > If we instead, as I propose, require that JSON headers *never* be > split, then it becomes both possible and rather obviously smarter > to define this header as a JSON object, keyed by the media type: > > Accept: { \ > "text/plain": <JSON for "q=0.5">, \ > "text/html": <JSON for no parameter>, \ > "text-xdvi": <JSON for "q=0.8">, \ > "text/x-c": <JSON for no parameter> \ > } > > A sender wishing to modify the priority, just sets the > corresponding JSON object using the native languages > JSON facility: > > req.accept["text/plain"] = <JSON for "q=0"> > > The receiver can then simply load the JSON as JSON, and the > application does not have to explicitly check for duplicate > media types, but can simply look up "text/plain" in the > JSON object. Yes, that's a different data model for this header field, that makes certain operations simpler. > So what happens if the sender sends bad JSON anyway ? > > Accept: { \ > "text/plain": <JSON for "q=0.5">, \ > "text/html": <JSON for no parameter>, \ > "text-xdvi": <JSON for "q=0.8">, \ > "text/x-c": <JSON for no parameter> \ > "text/plain": <JSON for "q=0.0">, \ > } > > Well, RFC7159 says: > > 4. Objects > > [...]The names within an object SHOULD be unique. > > An object whose names are all unique is interoperable in > the sense that all software implementations receiving that > object will agree on the name-value mappings. When the > names within an object are not unique, the behavior of > software that receives such an object is unpredictable. > Many implementations report the last name/value pair only. > Other implementations report an error or fail to parse the > object, and some implementations report all of the name/value > pairs, including duplicates. > > In other words: Don't do that. > > What really happens is this: We just opened another door to the > exact same smuggling attack as I mentioned above. > > But this time we can shut them all with one single line of text: > > "Duplicate keys in JSON objects SHALL cause and be treated > as connection failure." But then we can't rely on generic JSON parsers anymore. > So all in all, split JSON headers would be a really bad idea. After replying to your mail, I'm even more convinced that it's the right thing to do. Best regards, Julian
Received on Monday, 11 July 2016 06:03:34 UTC