| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by great_psy 889 days ago
	This a pretty strong claim with zero data to back it up

2 comments

eightysixfour 889 days ago

Every small model that has outperformed GPT-4 has proven to be an overfit, so I would say it is the obvious claim, and any claim opposite that is what we should be skeptical of.

link

anon373839 888 days ago

With the exception of task specialization. Fine-tuning a small model such as Mistral 7B on a specific set of tasks can outperform using GPT-4 on those tasks, and with cheaper and faster inference.

link

eightysixfour 888 days ago

Not on the leaderboards mentioned here. That’s my point, you can overfit for specific tasks, you can’t beat them on multi-task leaderboards without training on the test data.

link

lxe 889 days ago

While I lack specific data, my intuition is based on observed trends in AI model development. I believe some other models that claimed such numbers excelled in benchmarks but fell short in real-world applications. Further research can validate this claim, and I welcome a balanced discussion.

link

travisporter 889 days ago

It does seem incredible that chatgpt has so much expertise in literally everything. Does this mean you can beat chatgpt by creating smaller "experts" and directing questions to each?

link

great_psy 888 days ago

See mixture of experts. It’s likely what chatGPT does in the backend.

link