Hacker News new | ask | show | jobs
by aesthesia 41 days ago
Calling the AISLE experiment a "benchmark" is generous. They tested three code snippets on each model.