Hacker News new | ask | show | jobs
NoLiMa: Long-Context Evaluation Beyond Literal Matching (arxiv.org)
2 points by fovc 459 days ago