Hacker News new | ask | show | jobs
by chaoz_ 163 days ago
I extracted text-based quests from Space Rangers 2 (a 2004 Russian RPG) and tested Claude Opus, GPT-5.2, and Gemini on them.

Repo: https://github.com/NickKuts/llm-game-evals