|
|
|
|
|
by nl
145 days ago
|
|
To quote you: > here is no RL for programming languages. and > Either RL works & you have evidence This is just so completely wrong, and here is the evidence. I think everyone in this thread is just surprised you don't seem to know this. Haven't you seen the hundreds of job ads for people to write code for LLMs to train on? |
|