| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by runeblaze 199 days ago
	Is it though? There is a reason gpt has codex variants. RL on a specific task raises the performance on that task

1 comments

jjmarr 199 days ago

Post-training doesn't transfer over when a new base model arrives so anyone who adopted a task-specific LLM gets burned when a new generational advance comes out.

link

runeblaze 199 days ago

Resouce-affording, if you are chasing the frontier of some more niche task you redo your training regime on the new-gen LLMs

link