Y
Hacker News
new
|
ask
|
show
|
jobs
by
the_gipsy
68 days ago
With AIs, it seems like there never is a comparison that is useful.
2 comments
theptip
68 days ago
You can build evals. Look at Harbor or Inspect. It’s just more work than most are interested in doing right now.
link
jascha_eng
68 days ago
yup its all vibes. And anthropic is winning on those in my book still
link