Y
Hacker News
new
|
ask
|
show
|
jobs
by
miven
754 days ago
I don't get your point, how is what you're suggesting here different from a few papers we already have on KV cache pruning methods like [1]?
[1]
https://arxiv.org/abs/2305.15805