|
|
|
|
|
by impulser_
12 days ago
|
|
Because there is literally nothing special about coding hardnesses. The models are doing all the lifting. It just user experience that separates them. A coding hardness with just bash outperforms Codex, Claude Code, OpenCode, Pi ect. The added features are just user experience features. |
|
https://www.endorlabs.com/research/ai-code-security-benchmar...
There's a lot of ways to configure agents and any implicit configuration to harnesses may have a non-trivial effect.