|
|
|
|
|
by InitialPhase55
82 days ago
|
|
Curious, how did you settle on Haiku/Sonnet? Because there are much cheaper models on OpenRouter that probably perform comparatively... Consider Haiku 4.5: $1/M input tokens | $5/M output tokens
vs MiniMax M2.7: $0.30/M input tokens | $1.20/M output tokens
vs Kimi K2.5: $0.45/M input tokens | $2.20/M output tokens I haven't tried so I can't say for sure, but from personal experience, I think M2.7 and K2.5 can match Haiku and probably exceed it on most tasks, for much cheaper. |
|
It's getting some organic usage -- 100M input tokens for just chats this month -- and I've seen enough users try to throw Haiku against the wall and failing to trick it into misbehaving. It "pumps the breaks" a lot and imitates annoyance when you ask it repeatedly :) Handles emotionally driven real-life questions mid-conversation well. It just works.
Not seeing all that consistently with other models I've tried so far -- but I've assumed it's not a completely fair comparison with (e.g.) open weights, since these safety rails are presumably not always arising from the natural model calls.