Y
Hacker News
new
|
ask
|
show
|
jobs
by
lmeyerov
622 days ago
I encourage you to run evals on result quality for real b2b tasks before making these claims. Almost all of your post is measurably wrong in ways that cause customers to churn an AI product same-day.