| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by maciejgryka 763 days ago
	Isn't "two models slapped together" basically how all of these things work, starting with CLIP? Not sure about GPT4o, obviously, I don't think they released any underlying architecture details?

2 comments

Your understanding is correct, even GPT4o will have an encoder model.

what even is “a model”? I’m not sure there is a technical definition that corresponds to how it’s used by the tech public

- single interconnected neural network (LLM attention layers break this, autoencoders complicate this)

- single training pass (LLMs have multiple passes, GANs have a single but produce multiple models)

> - single training pass (LLMs have multiple passes, GANs have a single but produce multiple models)

LLMs have multiple passes? wdym?