|
|
|
|
|
by haberman
2022 days ago
|
|
I am trying to wrap my head around whether ARM's looser memory model is a fundamental performance advantage or not. I had always assumed that the looser memory model must have a performance benefit. But this comment from last week argues that it doesn't really buy that much, and that a bigger buffer can eliminate most of the difference: https://news.ycombinator.com/item?id=25263461 If TSO forces flushing of store buffers for every atomic access, that seems like a substantial disadvantage for x86. |
|
Without that the store buffers can be kept unflushed to, as an example, see if one can get a full cacheline or whatnot and only flush then.
The comment is correct that an X86 with heavy reordering backend will beat arm without one. However arm with one does handily beat X86 with one. Case in point: M1