Hacker News new | ask | show | jobs
by ipsum2 453 days ago
They're not good models. They over fit to LMArena leaderboard, but perform worse in real life scenarios compared to their competitors.

The exceptions are auto regressive image generation and audio models.