Hacker News new | ask | show | jobs
by p1esk 855 days ago
I haven’t read this paper but what you described is commonly done (look up top-k or top-p sampling and beam search as examples).