Hacker News new | ask | show | jobs
by Certhas 423 days ago
If this was just the effect you mention you would not expect the base model to surpass the RL model though. Plus their k are much smaller than that.

I think it's a very interesting and meaningful study.