Hacker News new | ask | show | jobs
by tome 913 days ago
We don't have an API in public availability yet but that's coming soon in the new year. We will be price competitive with OpenAI but much faster. Deploying Mixtral is work in progress so keep your eyes open for that too!
1 comments

Also make a long context Mistral-7B that spits 1000T/s
I'll do it if you promise to say "wow!" :D
Here you go:

https://www.youtube.com/watch?v=9c078xKGwdU

It's 850 tokes per second, so you don't have to say "wow" yet!