Hacker News new | ask | show | jobs
Benchmarking LLMs for Web Tasks (100x.bot)
3 points by shardullavekar 20 days ago