Y
Hacker News
new
|
ask
|
show
|
jobs
by
janwas
592 days ago
FWIW I ran a quick test of gemma.cpp on M3 Pro with 8 threads. Similar PaliGemma inference speed to an older AMD (Rome or Milan) with 8 threads. But the AMD has more cores than that, and more headroom :)