|
|
|
|
|
by geocar
2249 days ago
|
|
> The vector instructions can't really be farmed out because they can be scattered inline with regular scalar code. If you believe this, you won't believe what's in this box[1]. [1]: https://www.sonnettech.com/product/egfx-breakaway-puck.html > A memcopy of a small to medium-sized struct might be compiled into a bunch of 128bit mov for example and then immediately working on that moved struct I'm not sure that's true: rep movs is pretty fast these days. |
|
There's a fundamental difference between GPU code and vector CPU instructions, though. GPU shader instructions aren't interwoven with the CPU instructions.
Yes, if you restrict yourself to not arbitrarily mixing the vector code with the non-vector code, you can put the vector code off in a dedicated processor (GPU in this case). The GP explicitly stated that a lack of this restriction prevents efficiently farming it off to a coprocessor.