|
|
|
|
|
by jbellis
59 days ago
|
|
Isn't the "KV Compression Strategies (FAIR)" chart showing that the fancy complex algorithm only barely beats simple topk? The commentary says that topk "degrades rapidly at low ratios" but the same can be seen for HAE (Entropy + OLS). |
|
That said, the gains are modest right now, this is still a research prototype exploring the tradeoff, and there’s clearly more work to be done.