Y
Hacker News
new
|
ask
|
show
|
jobs
by
matthewfelgate
495 days ago
What's a reliable benchmark for measuring "AI Distortion" in these models, allowing for consistent tracking over time and potential improvement?