|
|
|
|
|
by ashtonsix
244 days ago
|
|
Oh right. That's sensible enough. Makes total sense to parallelise across multiple cores. I wouldn't expect a strictly linear speed-up due to contention on the memory bus, but it's not as bad as flat-lining after engaging 2-3 cores. On most AWS Graviton instances you should be able to pull ~5.5 GB/s per-core even with all cores active, and that becomes less of a bottleneck when considering you'll typically run a sequence of compute operations between memory round-trips (not just delta). |
|