Hacker News new | ask | show | jobs
by MeetingsBrowser 577 days ago
Are there any concrete benchmarks for comparing models for different types of programming tasks?
1 comments

not that i know, but that's something we def. need