Hacker News new | ask | show | jobs
EnvTrace: Simulation-Based Semantic Evaluation of LLM Code (arxiv.org)
1 points by amscotti 220 days ago