|
|
|
|
|
by findjashua
110 days ago
|
|
failed the car wash test. i think instead of postiioning as a general purpuse reasoning model, they'd have more success focusing on a specific use case (eg coding agent) and benchmark against the sota open models for the use case (eg qwen3-coder-next) |
|