|
|
|
|
|
by JKCalhoun
113 days ago
|
|
I've never understood the obsession with token/s. I'm fine with asking a question and then going on to another task (which might be making coffee). Even with a cloud-based LLM where the response is pretty snappy, I still find that I wander off and return when I am ready to digest the entire response. |
|
But the real kicker here is the 90s ttft, that means you ask a question and don't see anything for a full minute and a half.