Hacker News new | ask | show | jobs
by mountainriver 421 days ago
This also seems to be why rejection sampling + SFT seems just as good if not better in a lot of scenarios