Y
Hacker News
new
|
ask
|
show
|
jobs
by
yencabulator
529 days ago
It's amazingly bad.
> the main techinique behinds o1 is the reinforcement learining.