| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by eskibars 724 days ago
	That's correct. We've got a blog that talks a bit about it: https://vectara.com/blog/do-smaller-models-hallucinate-more/ Some people are surprised by smaller models having the ability to outperform bigger models, but it's something we've been able to exploit: if you fine tune a small model for a specific task (e.g. reduced hallucinations on a summarization task) as Intel has done, you can achieve great performance economically.