| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cautiouscat 16 days ago
	I don’t agree/disagree, but why does knowing Claude vs. Codex did it become crucial? What can you do with that information?

2 comments

sheept 16 days ago

It could be more helpful for comparing model performance than just vibes or benchmarks. For example, you could run analyses to compare average line count per change or revert rate by model. Perhaps there will be a paper out in the near future that scrapes AI usage in public repos for a broader dataset.

link

sdevonoes 15 days ago

We don’t want that

link

munchler 16 days ago

If, say, a certain version of Claude tends to be better at front-end than back-end work, that can be important for deciding how to use it in the future. Just like when managing human developers.

link