Hacker News new | ask | show | jobs
by jyu 698 days ago
LLM evals = unit tests

if your outputs are consumer facing, might want to red team too

this is good for thinking about how and why of evals https://hamel.dev/blog/posts/evals

for tooling i found promptfoo to be lightweight and easy to get started.