Y
Hacker News
new
|
ask
|
show
|
jobs
by
eoravkin
1042 days ago
Thanks a lot! We put a lot of energy into solving hallucination. The tldr is that we have a eval set to test hallucination and tweak stuff to optimize for performance on this eval set.