| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jldugger 1195 days ago
	OpenAI is treating GPT as a "foundational model". They spend time training the foundational model, then build on top of that. GPT was published may 2020. GPT 3.5 ("text-davinci-003" and "code-davinci-002") shipped a year ago, and ChatGPT was just a fine tuned on top of those. So they've had plenty of time to increase the training set, improve the architecture and run GPUs full power to get a GPT-4.