| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by moralestapia 677 days ago
	>the quintessential language model task of predicting the next word? Based on what? The whole test is flawed because of this. Even different LLMs would choose different answers and there's no objective argument to make for which one is the best.

1 comments

sorokod 677 days ago

The one provided in the original post.

link

moralestapia 677 days ago

I don't see any of that.

Quote?

link

JoelEinbinder 677 days ago

The prompts you see in the quiz are from real hacker news comments. Whatever word the commenter said next is the "correct" word.

link

moralestapia 677 days ago

This is what I see,

  Are you smarter than a language model?

  There are a lot of benchmarks that try to see how good language models are at human tasks. But how good are you at the quintessential language model task of predicting the next word?

And then a list of questions.

How am I supposed to know it has anything to do with HN?

link

JoelEinbinder 677 days ago

After the quiz, the source is linked along with the full comment.

link