Hacker News new | ask | show | jobs
by whimsicalism 762 days ago
Increasingly convinced that nobody on the public internet knows how to do actual LLM evaluations.
1 comments

I'm just glad that we are finally past the "Who was the 29th president of the United States" and "Draw something in the style of Van Gogh" LLM evaluation test everyone did in 2022-2023.