Hacker News new | ask | show | jobs
by jhp123 1131 days ago
LLMs also hallucinate during summarization tasks, adding topics that were not in the original
1 comments

I've built internal systems that do summarization based on knowledge retrieval systems for specific nonpublic corporate information.

With GPT-4, I find very little hallucinating. It very rarely deviates from the source material. Every time I've found something unexpected, there was a problem in the source material provided to the model.

"very little" is still an unacceptable amount for most fields.

Quantify "very little" over what time period, variations of use, fail states, sample size.