Y
Hacker News
new
|
ask
|
show
|
jobs
by
antirez
38 days ago
DS4 can process 460 prompt tokens per second. Not stellar but not so slow. On M3 max. See the benchmarks on readme.