|
|
|
|
|
by mhio
614 days ago
|
|
Would the profiles and resulting binaries be highly CPU specific? I couldn't find any cross hardware notes in the original paper. The example's I'm thinking of are CPU's with vastly different L1/L2/L3 cache profiles. Epyc vs Xeon. Maybe Zen 3 v Zen 5. Just wondering if it looks great on a benchmark machine (and a hyperscaler with a common hardware fleet) but might not look as great when distributing common binaries to the world. Doing profiling/optimising after release seems dicey. |
|