Hacker News new | ask | show | jobs
Prompt caching but for RL – 7.5x speedup on long-prompt/short-response workloads (castform.com)
4 points by kumama 44 days ago