Hacker News new | ask | show | jobs
Proposed New Test of AI Capabilities:)
1 points by VikRubenfeld 46 days ago
When Microsoft can turn a fleet of LLMs loose on the Azure UX, and Google can do the same for the Google Adwords UX, and reduce their level of dreadfulness substantially, it will go a long way to showing that frontier models are as good as claimed.