Hacker News new | ask | show | jobs
by hackerlight 841 days ago
Their smallest model outperforms GPT-4 on Code. I'm sceptical that it'll hold up to real world use though.
1 comments

Just a note that the 67.0% HumanEval figure for GPT-4 is from its first release in March 2023. The actual performance of current ChatGPT-4 on similar problems might be better due to OpenAI's internal system prompts, possible fine-tuning, and other tricks.