|
|
|
|
|
by vlovich123
408 days ago
|
|
I just tried giving it a coding snippet that has a bug. ChatGPT & Claude found the bug instantly. Mercury fails to find it even after several reprompts (it's hallucinating). On the upside it is significantly faster. That's promising since the edge for ChatGPT and Claude are in the prolonged time and energy they've spent building training infrastructure, tooling, datasets, etc to pump out models with high task performance. |
|
That's part of the reason to compare against older, smaller models since they're at a more comparable stage of development.