Hacker News new | ask | show | jobs
MLA: K/V cache compression with low-rank projection (huggingface.co)
1 points by samber 367 days ago