|
|
|
|
|
by maxalbarello
108 days ago
|
|
Also wondering how to evals agentic pipelines. For instance, I generated memories from my chatGPT conversation history, how do I know whether they are accurate or not? I would like a single number that I would use to optimize the pipeline with but I find it hard to figure out what that number should be measuring. |
|