Hacker News new | ask | show | jobs
by attentive 93 days ago
token/sec is meaningless without thinking level. If it's fast but keeps rambling about instead of jumping on it then it can take a very long time vs low token/sec but low/none thinking.