Hacker News new | ask | show | jobs
by sockgrant 56 days ago
LLM evals are well established, are these not applicable here?