Hacker News new | ask | show | jobs
by jchook 602 days ago
I updated the title to say GPT-4, but I believe the quality is still surprisingly close to 4o.

On HumanEval, I see 90.2 for GPT-4o and 89.0 for DeepSeek v2.5.

- https://blog.getbind.co/2024/09/19/deepseek-2-5-how-does-it-...

- https://paperswithcode.com/sota/code-generation-on-humaneval