Hacker News new | ask | show | jobs
by BizarroLand 194 days ago
I would assume that it is testing how well and appropriately the LLM responds to prompts.