Hacker News new | ask | show | jobs
by Lerc 70 days ago
To what extent do you feel the harness contributes relative to the model?

To put another way, how much inferior can the model be with a superior harness to achieve a similar result?

2 comments

Pi doesn't claim to get better results. It is conceptually simpler, smaller, and more transparent to the end user than most harnesses. It's as much about the things it doesn't do as about what it can do.
Significantly! See this recent post „Compare harnesses not models: Blitzy vs GPT-5.4 on SWE-Bench Pro” https://quesma.com/blog/verifying-blitzy-swe-bench-pro/