Y
Hacker News
new
|
ask
|
show
|
jobs
by
osti
54 days ago
Yeah... I would definitely call 2t/s unusable. For simple chats, I'd want at least 15 t/s. For agentic coding (which this model is advertised for), I'd want good prefill performance as well.