|
|
|
Show HN: EvalsHub: Your AI is failing in production and you don't know it
(evalshub.ai)
|
|
4 points
by neilsharma425
89 days ago
|
|
I was tired of stitching together Langfuse for tracing, promptfoo for red teaming and evals, and custom scripts for CI/CD. It was a mess so I built EvalsHub. EvalsHub does all of it in one place. Automatic production scoring, red teaming, prompt versioning, and CI/CD integration. Zero to full eval coverage in 30 minutes. Would love brutal feedback from anyone shipping AI in production. evalshub.ai |
|