Y
Hacker News
new
|
ask
|
show
|
jobs
by
spatular
100 days ago
There is also prompt processing that's compute-bound, and for agentic workflows it can matter more than tg, especially if the model is not of "thinking" type.