Preinternet people would routinely re-implement unix and get shell scripts working across systems. This benchmark shows that agentic LLMs can't even do that, not just for complex programs and scripts, but for simple programs and simple scripts. 0%. Which fits with claudes' inability to write a c compiler.