Hacker News new | ask | show | jobs
OPRD: On-Policy Representation Distillation (arxiv.org)
2 points by berlianta 13 days ago