Hacker News new | ask | show | jobs
by croes 762 days ago
>Needle in a Needlestack is a new benchmark to measure how well LLMs pay attention to the information in their context window

I asked GPT-4o for JavaScript code and got Python, so much for attention.

1 comments

What was your query?