Hacker News new | ask | show | jobs
by spott 316 days ago
Groq is offering 1k tokens per second for the 20B model.

You are unlikely to match groq on off the shelf hardware as far as I'm aware.