Hacker News new | ask | show | jobs
by ej88 85 days ago
They do, in the paper they mention they evaluate the LLM without tools