Hacker News new | ask | show | jobs
by sheept 21 days ago
It could be more helpful for comparing model performance than just vibes or benchmarks. For example, you could run analyses to compare average line count per change or revert rate by model. Perhaps there will be a paper out in the near future that scrapes AI usage in public repos for a broader dataset.
1 comments

We don’t want that