Hacker News new | ask | show | jobs
by neodypsis 787 days ago
Could something like that proposed in "Training Language Models to Generate Text with Citations via Fine-grained Rewards" [0] work for you?

0. https://arxiv.org/abs/2402.04315