Hacker News new | ask | show | jobs
by spatular 100 days ago
There is also prompt processing that's compute-bound, and for agentic workflows it can matter more than tg, especially if the model is not of "thinking" type.