Hacker News new | ask | show | jobs
by jimmyl02 943 days ago
One small difference is that the GPT architecture is just the decoder stack of the original transformer as opposed to the full encoder decoder stack in the original.

I agree the branding play on GPTs in general is pretty smart and strong from OpenAI though.

1 comments

Honestly i feel like the fact that everyone is just calling LLM's GPT at this point doesn't really help OpenAI, ChatGPT would, but the fact is that unlike "googling" something became synonymous for searching on the internet, GPT != OpenAI-ing something, GPT just became what people call LLM's it seems like lately, the fact the term isn't the name of the company or the full name "chatgpt-ing" sort of breaks that hold i feel like.
Most of the newer LLMs that have come out are GPTs though.