|
|
|
|
|
by ofermend
341 days ago
|
|
One of the biggest challenges in RAG Evaluation is the assumption that you somehow can get the "source of truth" generated, specifically the set of "golden answers" (or golden chunks/documents).
In practice that is extremely difficult and non scalable.
Open-RAG-Eval is a new open source project that aims to address that via reference-free evaluation such as UMBRELA and AutoNuggetizer scores. Repo: https://github.com/vectara/open-rag-eval
and a nice UI to use this with: openevaluation.ai Would love to hear feedback on this after you try it out and what you might want to see on the roadmap. |
|