| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by VadimPR 1215 days ago
	How good is the quality of this? BLOOM is a 176B parameter model, but it doesn't seem to compare to GPT-3 (175B parameters) in terms of output quality.

2 comments

lossolo 1215 days ago

It's because BLOOM is undertrained, you can prune a lot of weights in BLOOM and it doesn't impact performance. Look at Chinchilla paper[1], 70B model outperforms 175B GPT-3 model.

https://arxiv.org/abs/2203.15556

link

Der_Einzige 1215 days ago

In general, most giant LLMs are extremely undertrained at this time. Consider that most of the gains in RoBerta vs bert were from just continuing to train.

link

stevenhuang 1215 days ago

Cases of undertraining can be observed whenever the output is repeating gibberish or loops. Happened a lot in GPT2 ai dungeon days

link

leobg 1214 days ago

So can we continue training RoBERTa to get it to, say, GPT3 Ada level

link

rnosov 1215 days ago

Out of curiosity, how did your measure their respective performances? My understanding is that BLOOM roughly comparable to GPT-3 in performance on most NLP tasks. Were you comparing OpenAI davinci to raw BLOOM by any chance?

link

VadimPR 1203 days ago

Compared ChatGPT to BLOOM - which I know doesn't benefit from RLHF.

link