Hacker News new | ask | show | jobs
by vitus 79 days ago
Eh. It depends what your bottleneck is. If the bottleneck is now, say, CPU cache contention because you've doubled your thread count, it's entirely possible that FL1 running on the new server generation is operating in a different regime than on the previous generation. You can see some hints of that happening, since doubling thread count didn't result in a doubling of throughput.

In fact, I suspect based on the throughput doubling with FL2, we're back in the same regime as the baseline.

It would be useful to see what the latency is of FL2 on Gen12 compared to baseline (FL1 on Gen12), just to confirm.

1 comments

Yes fair points. The think it’s also indicative of how important it is that code be optimized for the specific hardware it will run on. Systems need to be considered and optimized as a whole. Still an interesting post.