Hacker News new | ask | show | jobs
by AndrewCook71 825 days ago
This is nice, we've got more Open Source LLM Evaluation Libraries coming in more often.

We're using DeepEval (https://github.com/confident-ai/deepeval) currently. How is this different from that?

1 comments

Deepeval also uses Ragas underneath. They initially took a different approach by allowing uses to formulate test cases but we were focusing on RAGs only and creating metrics and features like synthetic test data generation for it. Now that we are doing good in the RAG category, we also want to expand to solve the greater challenge.