Hacker News new | ask | show | jobs
Cascade Reward Sampling for Efficient Decoding-Time Alignment (arxiv.org)
3 points by Garcia98 701 days ago