Y
Hacker News
new
|
ask
|
show
|
jobs
by
mountainriver
421 days ago
This also seems to be why rejection sampling + SFT seems just as good if not better in a lot of scenarios