Hacker News new | ask | show | jobs
by 0cf8612b2e1e 3 days ago
Which ones fail?
2 comments

I tested DeepSeek V4 Pro, Qwen 3.6 Max, Qwen 3.7, Kimi K2.6, MiniMax M2.7 - they all fail to answer.

Curiously, MiniMax M3 answers correctly.

Deepkseek