Hacker News new | ask | show | jobs
Show HN: Zbench, RAG evals using chess Elo ratings (github.com)
3 points by ghita_ 329 days ago