Y
Hacker News
new
|
ask
|
show
|
jobs
Avatarl: Training langauge models from scratch with pure RL
(
tokenbender.com
)
2 points
by
krkartikay
309 days ago