|
|
|
|
|
by capevace
124 days ago
|
|
Seems like the industry is moving further towards having low-latency/high-speed models for direct interaction, and having slow, long thinking models for longer tasks / deeper thinking. Quick/Instant LLMs for human use (think UI).
Slow, deep thinking LLMs for autonomous agents. |
|
Slow, deep tasks are mostly for flashy one-shot demos that have little to no practical use in the real world.