Hacker News new | ask | show | jobs
by goodside 993 days ago
That's true, but it was expensive and until recently you could only tune older versions of GPT-3 lacking both instruction tuning and the code pre-training of the Codex models (from which GPT-3.5 is thought to descend). You had to want tuning so badly you were willing to 6x the token cost and go back in time 2 years.