Hacker News new | ask | show | jobs
by eoravkin 1042 days ago
Thanks a lot! We put a lot of energy into solving hallucination. The tldr is that we have a eval set to test hallucination and tweak stuff to optimize for performance on this eval set.