|
|
|
|
|
by verdverm
201 days ago
|
|
ADK has a few pages and some API for evaluating agentic systems https://google.github.io/adk-docs/evaluate/ tl;dr - challenging because different runs produce different output, also how do you pass/fail (another LLM/agent is what people do) |
|