Y
Hacker News
new
|
ask
|
show
|
jobs
by
slashdave
28 days ago
RL is more than facts. Synthetic feedback is an obvious approach. Does the model suggest code that compiles and performs well?