Hacker News new | ask | show | jobs
by yencabulator 529 days ago
It's amazingly bad.

> the main techinique behinds o1 is the reinforcement learining.