Hacker News new | ask | show | jobs
by jaredly 334 days ago
Giving a real-world complex ambiguous scenario to 14 different Agent LLMs