Y
Hacker News
new
|
ask
|
show
|
jobs
by
happycube
306 days ago
Maybe it's the exposure to Chinese? I've heard that training models on code first helps, so I could see it.
I've also heard hearsay that R1 is quite clever in Chinese, too.