Hacker News new | ask | show | jobs
by maciejgryka 763 days ago
Isn't "two models slapped together" basically how all of these things work, starting with CLIP? Not sure about GPT4o, obviously, I don't think they released any underlying architecture details?
2 comments

Your understanding is correct, even GPT4o will have an encoder model.
what even is “a model”? I’m not sure there is a technical definition that corresponds to how it’s used by the tech public

- single interconnected neural network (LLM attention layers break this, autoencoders complicate this)

- single training pass (LLMs have multiple passes, GANs have a single but produce multiple models)

> - single training pass (LLMs have multiple passes, GANs have a single but produce multiple models)

LLMs have multiple passes? wdym?