Y
Hacker News
new
|
ask
|
show
|
jobs
by
saliagato
759 days ago
gpt-4 was indeed trained on gpt-3 instruct series (davinci, specifically). gpt-4 was never a newly trained model
1 comments
whimsicalism
759 days ago
what are you talking about? you are wrong, for the record
link
fooker
759 days ago
They have pretty much admitted that GPT4 is a bunch of 3.5s in a trenchcoat.
link
whimsicalism
759 days ago
They have not. You probably read "MoE" and some pop article about what that means without having any clue.
link
matsemann
759 days ago
If you know better it would be nice of you to provide the correct information, and not just refute things.
link
whimsicalism
759 days ago
gpt-4 is a sparse MoE model with ~1.2T params. this is all public knowledge and immediately precludes the two previous commentators assertions
link