Re: RFC: Re: shader IR vs. application IR from Kai Ninomiya on 2017-08-17 (public-gpu@w3.org from August 2017)

From: Kai Ninomiya <kainino@google.com>
Date: Thu, 17 Aug 2017 02:26:21 +0000
To: Maciej Stachowiak <mjs@apple.com>
Cc: Filip Pizlo <fpizlo@apple.com>, Dean Jackson <dino@apple.com>, Kenneth Russell <kbr@google.com>, Corentin Wallez <cwallez@google.com>, David Neto <dneto@google.com>, "Myles C. Maxfield" <mmaxfield@apple.com>, public-gpu@w3.org
Message-ID: <CANxMeyBS1f3MewXtoo0BaUO3rSgtY=mmw=ZzxeMWwxv1oWXEVg@mail.gmail.com>
Merging threads.

On Wed, Aug 16, 2017 at 7:03 PM Filip Pizlo <fpizlo@apple.com> wrote:

> Memory safety is a property of wasm. You get to allocate an array called
> “linear memory”. You are allowed to attempt to grow this array. You can’t
> access outside it’s bounds. If you do, you are deterministically aborted.
>
On Wed, Aug 16, 2017 at 6:57 PM Maciej Stachowiak <mjs@apple.com> wrote:

> What exactly do you mean by "WebAssembly's IR" if not "WebAssembly's IR
> executed according to its specified semantics"? The specified semantics
> guarantee safety. The syntax alone does not, but that isn't a very
> interesting claim. Memory safety is a matter of semantics, not syntax.
>

My reason to separate the IR from the VM was to make parallels with SPIR-V
vs the SPIR-V execution environments. In wasm, both are part of the same
spec - but I think it can be split along the same boundary. Guarantees come
from the VM; optimizability comes from the IR.

On Wed, Aug 16, 2017 at 7:03 PM Filip Pizlo <fpizlo@apple.com> wrote:

> This is an important point, maybe even more important than the question of
> whether WebAssembly is secure by spec or by impl. You can give programmers
> more expressiveness and more performance if you design the language’s
> semantics to optimize the bounds checks. This observation is part of the
> genius of WebAssembly. I think that would be hard to do in SPIR-V since a
> lot of choices have already been made.
>

I absolutely agree, and I think that this is a much more useful topic to
discuss than whether SPIR-V is safe. Has SPIR-V made choices that make
bounds checking slower than it could be? I have no idea, and I don't think
there have been any concrete arguments either way.

On Wed, Aug 16, 2017 at 6:59 PM Maciej Stachowiak <mjs@apple.com> wrote:

> Shader code potentially runs in parallel with CPU code so it may take more
> than a one-time check to accomplish this.
>

This is certainly true and - depending on underlying API semantics - could
require additional synchronization to be added. But there's no way to avoid
this since we can't do anything that the backing APIs' shader languages
can't do.

-Kai


This is meaningfully materially from other IRs that do not have safety
> designed in.
>
> Regards,
> Maciej
>
>
>
> -Kai
>
> -Filip
>>
>> Based on my non-expert understanding of WebAssembly, this is how it
>> handles some of the example issues you gave:
>>
>> > Any exposure of pointer values
>> > Pointer arithmetic
>> > Read or write to any location without bounds checks
>> > Ability to read or write another process's main or video memory
>> > Ability to read or write another shader's main or video memory
>> > Ability to read or write video memory that holds the contents of a
>> webpage, even one that is same-origin with the shader
>>
>> Prevented by either (a) bounds checks (against the size of global memory)
>> in the AOT/JIT generated native code, or (b) by CPU MMU control.
>> (In a shader, of course, each resource would have its own bounds check
>> sizes. So bounds constants probably can't be baked into the AOT/JIT
>> generated code.)
>>
>> > Ability to hold a pointer or reference to an object after it may have
>> been freed, and still perform access through it
>> > Potential for race conditions which may lead to access to
>> out-of-bounds, freed, reallocated or not-yet-freed memory
>>
>> Heap allocations do not exist at the IR level (except for "request global
>> memory expansion"). malloc and free are implemented inside the wasm program.
>> Shaders do not allow "heap"/"global memory" allocations (and I don't
>> think SPIR-V provides a way to do them).
>>
>> > Function call or jump to an arbitrary addresses (jumping to known
>> functions or labels is ok)
>> > Mixing dynamically indexed data structures, and function pointers or
>> executable code in the same memory space
>>
>> WebAssembly instructions are not stored in program-accessible memory.
>> Jumps are only possible to known functions and labels. Both seem to be true
>> of SPIR-V as well.
>>
>> > Ability to execute an infinite loop without ever being interrupted by a
>> watchdog or the like
>>
>> Just like a JS infinite loop, the process can be killed or the VM can
>> generate cooperatively-terminable code. For WebGL, a watchdog is typically
>> used.
>>
>> -Kai
>>
>> On Wed, Aug 16, 2017 at 5:24 PM Maciej Stachowiak <mjs@apple.com> wrote:
>>
>>>
>>> On Aug 16, 2017, at 4:39 PM, Dean Jackson <dino@apple.com> wrote:
>>>
>>>
>>>
>>> On 17 Aug 2017, at 05:08, Maciej Stachowiak <mjs@apple.com
>>> <mjs@apple..com>> wrote:
>>>
>>> But bolting on security is not impossible; as you point out, it's been
>>> done for GLSL.
>>>
>>>
>>> Restricting GLSL for security wasn't particularly difficult. It doesn't
>>> have pointers, for example. This is another argument for a language that is
>>> slightly higher than an IR: It probably will be easier to secure.
>>>
>>>
>>> I'm not totally sure that's true. WebAssembly is an example of an IR
>>> that has memory safety designed in. A colleague pointed out to me that
>>> WebAssembly also takes a very hard stance on limiting nondeterminism: <
>>> https://github.com/WebAssembly/design/blob/master/Nondeterminism.md>.
>>> This is in contrast with SPIR-V where many things are undefined.
>>>
>>> To be clear, I am *not* saying WebAssembly should be the shader format.
>>> I think it's probably not appropriate, because its memory safety depends in
>>> part on limiting the whole WebAssembly program to a single contiguous
>>> memory region. To the best of my understanding, that would not really work
>>> for shaders. I'm just citing it as an example of a binary IR language that
>>> meets web safety requirements, because it was designed with them in mind.
>>>
>>> Java bytecode is arguably another IR designed for memory safety, though
>>> the complexity of their strategy for achieving it made Java not actually
>>> very web safe in practice.
>>>
>>>  - Maciej
>>>
>>> For the SPIR-V folks, has any implementation actually used Logical
>>> Addressing Mode? I'd also like to understand what these proposed
>>> restrictions to SPIR-V mean for compute shaders. How are they impacted by
>>> Logical Addressing Mode?
>>>
>>>
>>>
Attachments

application/pkcs7-signature attachment: S/MIME Cryptographic Signature
Received on Thursday, 17 August 2017 02:27:02 UTC