| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by applgo443 1486 days ago
	I work on building language models at FAANG for my day-day job and I'm very curious about this - how does finetuning on T5 remove hallucination? Can you elaborate on this? This is an amazing product, btw. Let me know if you're looking for people to hire :)

1 comments

rushingcreek 1486 days ago

We don't use T5 exactly, we use a derivative that has a similar (but not identical) architecture and is also pre-trained differently. That model, combined with our factual generation dataset and clever prompt engineering, seems to be the secret sauce for reducing hallucination.

And thank you :) it's comments like this that really fire us up

link