Y
Hacker News
new
|
ask
|
show
|
jobs
by
whimsicalism
762 days ago
Increasingly convinced that nobody on the public internet knows how to do actual LLM evaluations.
1 comments
tedeh
762 days ago
I'm just glad that we are finally past the "Who was the 29th president of the United States" and "Draw something in the style of Van Gogh" LLM evaluation test everyone did in 2022-2023.
link