Y
Hacker News
new
|
ask
|
show
|
jobs
Avatarl: Training language models from scratch with pure reinforcement learning
(
tokenbender.com
)
2 points
by
haneefmubarak
307 days ago