Hacker News new | ask | show | jobs
by ethbr1 132 days ago
Isn't that just RL with extra power-intensive steps? (An entire model chugging away in the goal function)
1 comments

That's correct, but if successful you'd essentially have updated the LLM's knowledge and capabilities "on the fly".
Maybe we could run off-peak load of that nature, when power is cheaper. Call it dreaming. ;)