Hacker News new | ask | show | jobs
by jorisw 4 hours ago
I like the Possible Hallucinations feature. Seems like a feature that could stand on its own. Interested in how you separate those out.
2 comments

Thanks! It is part of the clustering step, I tell the model to make a judgement of whether something is inanimate or hallucinated (as defined by low support from only non-frontier models / judgement). I iterated on this a lot and made an eval set out of my LinkedIn contacts where I run GPT5.5 with web search and xhigh reasoning to determine pseudo ground truth. I tuned this to be higher recall (more things classified as non-hallucination) but it is definitely not perfect
In my case the possible hallucination was the only one that was 100% factual.