Y
Hacker News
new
|
ask
|
show
|
jobs
by
croes
762 days ago
>Needle in a Needlestack is a new benchmark to measure how well LLMs pay attention to the information in their context window
I asked GPT-4o for JavaScript code and got Python, so much for attention.
1 comments
kolinko
762 days ago
What was your query?
link