Y
Hacker News
new
|
ask
|
show
|
jobs
by
tekne
3 hours ago
The raw pretrained models make the errors, I believe -- we then reinforcement-learn them out.
1 comments
Tomte
46 minutes ago
That‘s interesting! Do you have a paper or blog post or so at hand that shows examples of raw and RL‘ed output?
link