Hacker News new | ask | show | jobs
TLAi+ Benchmarks for Evaluating LLMs (github.com)
2 points by alhazrod 105 days ago