Hacker News new | ask | show | jobs
by bearjaws 876 days ago
Together.ai seems to be the best, incredibly fast.
2 comments

Not so sure about that. Check out https://github.com/ray-project/llmperf-leaderboard

And try mixtral on chat.groq.com

These guys are much faster than openrouter, and their llama2 runs faster than 3.5-turbo. Amazing work.