| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sheept 21 days ago
	It could be more helpful for comparing model performance than just vibes or benchmarks. For example, you could run analyses to compare average line count per change or revert rate by model. Perhaps there will be a paper out in the near future that scrapes AI usage in public repos for a broader dataset.

1 comments

We don’t want that