Y
Hacker News
new
|
ask
|
show
|
jobs
by
runeblaze
152 days ago
Is it though? There is a reason gpt has codex variants. RL on a specific task raises the performance on that task
1 comments
jjmarr
152 days ago
Post-training doesn't transfer over when a new base model arrives so anyone who adopted a task-specific LLM gets burned when a new generational advance comes out.
link
runeblaze
151 days ago
Resouce-affording, if you are chasing the frontier of some more niche task you redo your training regime on the new-gen LLMs
link