| Hey HN, After watching too many agents confidently lie in production, I built Director-AI. It sits between your LLM and the user, scoring every generated token with:
• 0.6× DeBERTa-v3 NLI (contradiction detection)
• 0.4× RAG against your own ChromaDB knowledge base If coherence < threshold → Rust kernel halts the stream before the token is sent. Key technical bits:
• Works with any OpenAI-compatible endpoint (Ollama, vLLM, llama.cpp, Groq, OpenAI, Claude…)
• StreamingKernel + windowed scoring
• GroundTruthStore.add() for easy fact ingestion
• Dual licensing: AGPL open + commercial (closed-source/SaaS OK) Honest AggreFact numbers inside (66.2% balanced acc with streaming enabled). Not claiming SOTA on static NLI — the value is in the live gating + custom KB system. Repo + full examples: https://github.com/anulum/director-ai Would love feedback on the scoring weights, halt logic, or kernel design. What hallucination problems are you solving today? |
I shipped *v1.2.0* overnight with everything you asked for:
• Full end-to-end benchmark notebook (600+ real RAG/agent traces, HaluEval + TruthfulQA, head-to-head vs Claude self-critique, latency, false positives, recovery rate) → notebooks/04_end_to_end_benchmark.ipynb
• Rich evidence on every halt: top-K conflicting chunks + highlighted NLI premise/hypothesis + distances (now in HaltEvent + dashboard)
• Ready-made graceful fallbacks (soft warning, retrieval-only retry, partial+correction) → examples/graceful_fallbacks.py
• Live Hugging Face Spaces demo (try it yourself): https://huggingface.co/spaces/anulum/director-ai-guardrail
• Full MkDocs site (22 pages), native OpenAI/Anthropic interceptors, score caching, 8-bit NLI, bge-large, LangGraph/Haystack/CrewAI support
Repo: https://github.com/anulum/director-ai Changelog: https://github.com/anulum/director-ai/releases/tag/v1.2.0
Would love feedback on the new bits — especially the end-to-end numbers and graceful patterns. Fire away!