Hacker News new | ask | show | jobs
by nerdsniper 51 days ago
It would not be shocking if recent KV cache was used to steer future requests. Not necessarily in a “divulge customer text” way but in a “focus on this part of the embedding space” way.