Hacker News new | ask | show | jobs
by dakshgupta 180 days ago
We weren’t able to find a good quality measure. LLM-as-judge dint feel right. You’re correct that without that the data is interesting but not particular insightful.