Hacker News new | ask | show | jobs
by lmeyerov 622 days ago
I encourage you to run evals on result quality for real b2b tasks before making these claims. Almost all of your post is measurably wrong in ways that cause customers to churn an AI product same-day.