Hacker News new | ask | show | jobs
NoLiMa: Long-Context Evaluation Beyond Literal Matching (arxiv.org)
1 points by hexhowells 486 days ago