|
|
|
|
|
by adi4213
20 days ago
|
|
> There's just not a lot of juice left to squeeze for Gemini to tell you exactly how tall Ke$ha is or when the last time Brittney Spears went to jail was... But there is a ton of juice left to squeeze when it comes to post-training/RL for a ton of useful things in practice, right?
It’s been amazing seeing how good modern model tool use is for example, and I bet there is a lot of room for improvement still (no doubt that a ton of improvement can be made more easily on the agent harness front or via post-training regimes like LoRa (which does support to your point about diminishing pre-training juice)) |
|