Hacker News new | ask | show | jobs
by lurkshark 107 days ago
There are a few “updating” benchmarks out there. I periodically take a look at these two:

https://swe-rebench.com/

https://livebench.ai/