Hacker News new | ask | show | jobs
BalatroBench – Benchmarking LLMs' Strategic Performance Through Games (balatrobench.com)
3 points by S1M0N38-hn 141 days ago