Hacker News new | ask | show | jobs
Compare sota LLMs on web dev tasks (web.lmarena.ai)
5 points by tizkovatereza 559 days ago
1 comments

WebDev Arena lets you compare models like Claude Sonnet, GPT-4o, Qwen, or Gemini for free.

It compares the models on web dev tasks of your chice (e.g. "create a productivity tracker app" or "make a data analyst dashboard"), lets you vote for the best result, and then reveals which model created which result.

It's built with E2B as the runtime for generating the apps by executing the LLM's code in the sandbox environment.