Y
Hacker News
new
|
ask
|
show
|
jobs
by
artemisart
15 days ago
Hill climbing doesn't mean much but absolutely doesn't imply they cheat on benchmarks. They have more details here
https://microsoft.ai/news/introducing-mai-thinking-1/
it seems to be "RL on everything".