| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by anotherpaulg 648 days ago

Yi-Coder scored below GPT-3.5 on aider's code editing benchmark. GitHub user cheahjs recently submitted the results for the 9b model and a q4_0 version.

Yi-Coder results, with Sonnet and GPT-3.5 for scale:

  77% Sonnet
  58% GPT-3.5
  54% Yi-Coder-9b-Chat
  45% Yi-Coder-9b-Chat-q4_0

Full leaderboard:

https://aider.chat/docs/leaderboards/