Hacker News new | ask | show | jobs
Bootstrapping AI Evals from Context (Why 'Just Asking Claude' Fails) (scorable.ai)
1 points by Arimbr 60 days ago