Hacker News new | ask | show | jobs
Show HN: Hallx – Hallucination risk scoring for LLM outputs (github.com)
2 points by akadhanu 85 days ago
I got tired of LLM outputs silently failing in pipelines, so I built a small scoring layer around it.

It checks three things before your output moves forward: does it match the schema you expected is it consistent across runs does it actually align with the context you provided

Returns a confidence score and a risk level. That's mostly it.

Works with OpenAI, Anthropic, Gemini, Ollama and a few others. Sync and async both supported. It's heuristic, not a guarantee. If your context is bad, the scores will be too. Hit a star, if you found this useful.

Try now: pip install hallx

1 comments

Hey this is cool. Maybe your gh page could be improved with a few examples of your product in use?
yeah.. right now working on that.. thank you for your insights :)