|
|
|
|
|
by batperson
13 days ago
|
|
Before they removed it, I was using groq Kimi K2 model for a chat bot in small community site/chat. It was really good, seemed to have incredibly vast general world knowledge and the fast speed (400tok/s if I remember right) meant that chat users got a response instantly which was a much better experience compared to other SOTA models at the time. On the bright side it looks like Cerebras might be serving Kimi K2.6 at 1000tok/s soon https://www.cerebras.ai/blog/cerebras-kimi-k2-Enterprise |
|
I'm also looking forward for the Cerebras Kimi K2.6 release, which should be even better at 1000 tps. It is hard to overstate how important speed is for programming. Instead of having to wait for a few minutes until a task is done, it is just done instantly, and you don't have to context switch from whatever else you were working on while waiting.
I hope they will make it available to regular customers.